Text Processing packages

Showing projects tagged as HTTP and Text Processing

  • httpie

    9.7 6.6 L3 Python
    🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
  • Jinja2

    9.0 7.8 L3 Python
    A very fast and expressive template engine.
  • Pattern

    8.8 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Sphinx

    8.7 9.8 L2 Python
    The Sphinx documentation generator
  • HTTP Prompt

    8.5 0.0 L4 Python
    An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie
  • WeasyPrint

    8.5 9.6 L1 Python
    The awesome document factory
  • Python-Markdown

    7.7 7.4 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • trafilatura

    7.6 6.8 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • Scrapely

    6.1 0.0 HTML
    A pure-python HTML screen-scraping library
  • python-user-agents

    5.4 0.0 L4 Python
    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
  • selectolax

    5.0 9.3 Cython
    Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.
  • MarkupSafe

    4.3 7.0 L5 Python
    Safely add untrusted strings to HTML/XML markup.
  • htmldate

    2.3 3.6 Python
    Fast and robust date extraction from web pages, with Python or on the command-line
  • PatZilla

    2.2 5.4 Python
    PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
  • Kotori

    2.1 2.0 Python
    A flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple.
  • Template Render Engine

    1.0 2.4 L4 Python
    Template Render Engine
  • DisCapTy

    0.8 5.4 Python
    DISCONTINUED. DisCapTy is a Python module to generate Captcha images without struggling your mind on how to make your own. Everyone can use it!
  • Doublify API Toolkit

    0.5 0.0 Python
    DISCONTINUED. Doublify API toolkit for Python
  • loggingutil

    0.4 2.1 Python
    Python logging utility package for simplicity
  • GoBeautifulSoup

    0.3 3.3 Python
    GoBeautifulSoup is a high-performance HTML/XML parsing library that provides a 100% compatible API with BeautifulSoup4, but powered by Go for dramatically improved performance. It's designed as a drop-in replacement for BeautifulSoup4 with significant speed improvements.