Stars
Produce redistributable builds of Python
A zero-dependency, high-performance Khmer word segmenter using the Viterbi algorithm. Optimized for dictionary accuracy, ultra-low memory footprint, and edge deployment.
A spaCy model for Esperanto, trained to provide high-quality linguistic annotations.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Freeze (package) Python programs into stand-alone executables
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
Unofficial Python security updates for Windows
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
How to train a Tibetan language model for SpaCy
State of the Art Natural Language Processing
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks
An extremely fast Python linter and code formatter, written in Rust.
བོད་ཐོག BoTok custom dialect pack for modern Tibetan
Python stemming library using snowball stemmers
Open-source offline translation library written in Python
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
BNLP is a natural language processing toolkit for Bengali Language.
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social …
A feature-rich dictionary lookup program, supporting multiple dictionary formats (StarDict/Babylon/Lingvo/Dictd) and online dictionaries, featuring perfect article rendering with the complete marku…
Lao language Natural Language Processing toolkit
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
no-plagiarism / pymorphy3
Forked from pymorphy2/pymorphy2Morphological analyzer / inflection engine for Russian and Ukrainian languages.
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency