Stars
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
💫 Industrial-strength Natural Language Processing (NLP) in Python
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
The interactive graphing library for Python ✨
100+ Chinese Word Vectors 上百种预训练中文词向量
A little word cloud generator in Python
code for Data Science From Scratch book
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Python library for processing Chinese text
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Bili23 Downloader 是一款跨平台(Windows/Linux/macOS)的 B 站视频下载工具,支持下载 B 站投稿视频、番剧、电影等类型视频。支持多线程加速、断点续传等特性,搭配图形化界面与零配置操作,提供高效便捷的下载体验。
Top2Vec learns jointly embedded topic, document and word vectors.
The machine learning toolkit for time series analysis in Python
A Python library that helps data scientists to infer causation rather than observing correlation.
An Efficient Lexical Analyzer for Chinese
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.