Stars
Creating beautiful plots of data maps
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
State-of-the-Art Text Embeddings
Fuzzy string matching, grouping, and evaluation.
pepy is a site to get statistics information about any Python package.
Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.
A Python nearest neighbor descent for approximate nearest neighbors
A high performance implementation of HDBSCAN clustering.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.