Markup packages

Showing projects tagged as Text Processing, XML, and Markup

  • xmltodict

    8.0 8.6 L4 Python
    Python module that makes working with XML feel like you are working with JSON
  • trafilatura

    7.6 6.8 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • lxml

    7.0 9.6 L2 Python
    The lxml XML toolkit for Python
  • xhtml2pdf

    6.8 5.4 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • aeneas

    6.6 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
  • feedparser

    6.3 7.4 L3 Python
    Parse feeds in Python
  • Atoma

    2.0 0.0 Python
    Atom, RSS and JSON feed parser for Python 3
  • GoBeautifulSoup

    0.3 3.3 Python
    GoBeautifulSoup is a high-performance HTML/XML parsing library that provides a 100% compatible API with BeautifulSoup4, but powered by Go for dramatically improved performance. It's designed as a drop-in replacement for BeautifulSoup4 with significant speed improvements.