Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Aashit-Sharma's full-sized avatar
🧠
🧠
  • Hong Kong

Block or report Aashit-Sharma

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,822 2,670 Updated Jul 3, 2025

The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.

Python 180 19 Updated May 13, 2022

[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks

Jupyter Notebook 60 9 Updated Nov 24, 2022

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 2,062 176 Updated Aug 15, 2024

Course materials for Dartmouth course: Human Memory (PSYC 51.09)

TeX 264 14 Updated May 3, 2025

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,610 531 Updated Oct 16, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,227 6,517 Updated Sep 19, 2025

multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

Python 372 52 Updated Nov 21, 2022

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Shell 4,755 1,294 Updated Nov 8, 2025

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 771 104 Updated Jul 22, 2025

Implementation of Marge, Pre-training via Paraphrasing, in Pytorch

Python 76 11 Updated Jan 14, 2021

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Python 35,840 10,842 Updated Oct 19, 2025

Fuzzy string matching, grouping, and evaluation.

Python 784 71 Updated Jul 10, 2025

⚫ A spaCy pipeline and model for NLP on unstructured legal text.

Python 663 105 Updated Jul 16, 2024

甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon construction, tokenizing, POS tagging, sentence segmentation a…

Python 629 70 Updated Nov 2, 2021

The world's cleanest AutoML library ✨ - Do hyperparameter tuning with the right pipeline abstractions to write clean deep learning production pipelines. Let your pipeline steps have hyperparameter …

Python 614 63 Updated May 4, 2025

Library for Knowledge Intensive Language Tasks

Python 956 90 Updated Mar 31, 2022

Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers

Jupyter Notebook 160 24 Updated Sep 25, 2020

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

Python 2,817 233 Updated Jun 23, 2023

A relation-aware semantic parsing model from English to SQL

Python 442 121 Updated Aug 22, 2023

Top2Vec learns jointly embedded topic, document and word vectors.

Python 3,094 377 Updated Nov 14, 2024

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

Python 1,267 135 Updated Mar 2, 2023

Python library for building highly effective data science workflows

Python 948 73 Updated Jul 20, 2023

Pytorch library for fast transformer implementations

Python 1,749 188 Updated Mar 23, 2023

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,321 2,131 Updated Oct 27, 2025

A set of methods for finding an appropriate number of topics in a text collection

Python 16 4 Updated Apr 14, 2025

Text preprocessing, representation and visualization from zero to hero.

Python 2,910 239 Updated Aug 29, 2023

Google search from Python (unofficial).

Python 1,232 405 Updated Apr 27, 2024

Trax — Deep Learning with Clear Code and Speed

Python 8,295 828 Updated Sep 26, 2025
Next