Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View hulongji's full-sized avatar

Block or report hulongji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
72 stars written in Python
Clear filter

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 78,623 15,133 Updated May 10, 2024

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 41,669 4,127 Updated Nov 20, 2025

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,353 6,269 Updated Jul 26, 2024

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 36,107 10,909 Updated Nov 15, 2025

结巴中文分词

Python 34,718 6,723 Updated Aug 21, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,097 4,634 Updated Nov 27, 2025

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Python 19,969 4,646 Updated Jan 22, 2026

The interactive graphing library for Python ✨

Python 18,208 2,774 Updated Jan 14, 2026

Network Analysis in Python

Python 16,546 3,456 Updated Jan 23, 2026
Python 16,357 1,551 Updated Jan 23, 2026

Topic Modelling for Humans

Python 16,333 4,413 Updated Nov 1, 2025

Parallel computing with task scheduling

Python 13,728 1,836 Updated Jan 23, 2026

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 12,173 2,331 Updated Oct 30, 2023

A little word cloud generator in Python

Python 10,502 2,335 Updated Jan 22, 2026

Declarative visualization library for Python

Python 10,222 833 Updated Jan 21, 2026

code for Data Science From Scratch book

Python 9,460 4,708 Updated Nov 9, 2023

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Python 8,469 602 Updated Nov 1, 2025

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,717 934 Updated Jan 24, 2026

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 7,346 876 Updated Jan 21, 2026

Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts

Python 7,096 414 Updated Jan 21, 2026

Python library for processing Chinese text

Python 6,602 1,365 Updated Jan 19, 2020

Language Technology Platform

Python 5,230 1,057 Updated Jun 2, 2025

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,227 547 Updated Sep 8, 2025

Bili23 Downloader 是一款跨平台(Windows/Linux/macOS)的 B 站视频下载工具,支持下载 B 站投稿视频、番剧、电影等类型视频。支持多线程加速、断点续传等特性,搭配图形化界面与零配置操作,提供高效便捷的下载体验。

Python 3,381 238 Updated Dec 24, 2025

Top2Vec learns jointly embedded topic, document and word vectors.

Python 3,108 376 Updated Nov 14, 2024

The machine learning toolkit for time series analysis in Python

Python 3,086 365 Updated Jan 23, 2026

A Python library that helps data scientists to infer causation rather than observing correlation.

Python 2,426 277 Updated Jun 26, 2024

An Efficient Lexical Analyzer for Chinese

Python 2,092 336 Updated Jan 31, 2022

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

Python 1,710 251 Updated Nov 17, 2025
Next