Textstat

Textstat is an easy to use Python library that analyzes text to provide detailed statistics, readability scores, and complexity metrics. Perfect for content analysis, education, and natural language processing.

^{Photo by Patrick Tomasso
on Unsplash}

Usage

>>> from textstat import Text, Sentence, Word

>>> my_text = Text(
  "Alice was beginning to get very tired of sitting by her sister on the "
  "bank, and of having nothing to do: once or twice she had peeped into "
  "the book her sister was reading, but it had no pictures or "
  "conversations in it, “and what is the use of a book,” thought Alice "
  "“without pictures or conversations?”"
)

>>> my_text.stats()
{'letters': 236, 'characters': 246, 'words': 57, 'sentences': 1}

>>> my_text.flesch_reading_ease()
31.727368421052645

>>> my_text.filter(Word.length >= 10)
[Word('conversations'), Word('conversations')]

For full documentation, see https://docs.textstat.org/

Installation

textstat is available on PyPi and Conda Forge.

pip install textstat

conda install textstat

Name		Name	Last commit message	Last commit date
Latest commit History 478 Commits
.github		.github
.vscode		.vscode
tests		tests
textstat		textstat
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Textstat

Usage

Installation

About

Uh oh!

Releases 25

Uh oh!

Contributors 42

Uh oh!

Languages

License

textstat/textstat

Folders and files

Latest commit

History

Repository files navigation

Textstat

Usage

Installation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 25

Uh oh!

Contributors 42

Uh oh!

Languages