Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View carmen852's full-sized avatar

Block or report carmen852

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Turn your data frame into a tableau style drag and drop UI interface to build visualization in R.

TypeScript 537 53 Updated Jul 3, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,425 3,893 Updated Jul 23, 2024

Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.

Python 3,262 1,032 Updated Nov 3, 2023

Visual scraping for Scrapy

Python 9,482 1,398 Updated Jun 26, 2024

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 59,580 11,220 Updated Jan 23, 2026

Web Scraper in Go, similar to BeautifulSoup

Go 2,226 169 Updated Nov 2, 2023

Linguistic Inquiry and Word Count (LIWC) analyzer

Python 233 52 Updated Dec 20, 2021

Obsidian Plugin for social-scientific Qualitative Data Analysis (QDA). An open alternative to MAXQDA and atlas.ti, using Markdown to store data and research codes.

TypeScript 134 4 Updated Jan 7, 2026

Gephi - The Open Graph Viz Platform

Java 6,346 1,593 Updated Jan 11, 2026

💫 Models for the spaCy Natural Language Processing (NLP) library

Python 1,839 312 Updated May 27, 2025

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,127 4,636 Updated Nov 27, 2025

Python脚本。模拟登录知乎, 爬虫,操作excel,微信公众号,远程开机

Python 10,494 4,267 Updated Oct 10, 2023

All Algorithms implemented in Python

Python 217,282 49,999 Updated Jan 25, 2026

An R package for the Quantitative Analysis of Textual Data

R 871 190 Updated Jan 29, 2026

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Python 2,054 494 Updated Sep 23, 2020

自然语言处理,知识图谱相关语料。按照Task细分,欢迎PR。

Python 731 155 Updated Jan 15, 2021

A multilingual dialog corpus

Python 1,412 1,161 Updated Nov 8, 2025

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

Jupyter Notebook 6,458 1,427 Updated Jan 29, 2019

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,856 1,557 Updated Sep 8, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 171,882 54,252 Updated Jan 29, 2026

SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.

Python 7,630 626 Updated Nov 7, 2025

Collection of various algorithms implemented in R.

R 1,097 342 Updated Oct 26, 2025

A Zotero plugin for syncing items and notes into Notion

TypeScript 3,017 125 Updated Jan 2, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 155,888 31,892 Updated Jan 28, 2026

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

Python 2,428 508 Updated Jan 22, 2022

Transformer related optimization, including BERT, GPT

C++ 6,390 928 Updated Mar 27, 2024

Google AI 2018 BERT pytorch implementation

Python 6,515 1,329 Updated Sep 15, 2023

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 7,358 877 Updated Jan 28, 2026

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,455 4,651 Updated Jan 9, 2026

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 78,723 15,139 Updated May 10, 2024
Next