Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View jonason92's full-sized avatar
🐳
Checkin' In
🐳
Checkin' In

Block or report jonason92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contexts Optical Compression

Python 21,495 1,922 Updated Oct 25, 2025

📝 Automatically annotate papers using LLMs

Python 391 39 Updated Dec 1, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,224 1,254 Updated Dec 12, 2025

Get your documents ready for gen AI

Python 47,223 3,325 Updated Dec 19, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 32,042 2,234 Updated Dec 15, 2025

The headless rich text editor framework for web artisans.

TypeScript 34,120 2,796 Updated Dec 19, 2025

Open-source technology for creating full-stack knowledge applications for communities of all types.

TypeScript 73 9 Updated Dec 18, 2025

CiteSeerX public repository

HTML 134 59 Updated Jun 4, 2024

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

C++ 1,362 153 Updated Sep 13, 2023

A complete alternative for Overleaf with VSCode + Web + Git Integration + Copilot + Grammar & Spell Checker + Live Collaboration Support. Based on GitHub Codespace and Dev container.

TeX 1,379 387 Updated May 23, 2024

A hatch plugin to help build Jupyter packages

Python 46 14 Updated Jun 16, 2024

Markdown <=> IPython Notebook

Jupyter Notebook 859 111 Updated Oct 18, 2021

Allow you to access your calibre libraries and read books directly in Obsidian.

TypeScript 183 6 Updated Sep 19, 2023
JavaScript 26 6 Updated Dec 11, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 30,456 2,071 Updated Nov 19, 2025

Convert Word documents (.docx files) to HTML

JavaScript 6,005 640 Updated Nov 20, 2025

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 13,439 1,109 Updated Dec 19, 2025

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Java 3,474 900 Updated Dec 18, 2025

Python tool for converting files and office documents to Markdown.

Python 84,401 4,856 Updated Dec 1, 2025

Use a text editor. Make a PDF.

Python 584 153 Updated Dec 18, 2025

Mdformat plugin for MyST compatibility

Python 16 7 Updated Nov 27, 2025

Interpreter for interactive educational content, written in an extended Markdown format...

Elm 258 37 Updated Dec 16, 2025

Parse PDFs into markdown using Vision LLMs

Python 454 64 Updated Oct 4, 2025

Convert PDF to markdown quickly with high accuracy

Python 1 Updated Nov 11, 2024
PHP 4 3 Updated Jan 15, 2025

Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)

HTML 24 17 Updated Feb 27, 2020

Quarto JupyterLab Extension

TypeScript 27 6 Updated Aug 19, 2025

The Project Gutenberg tool to generate EPUBs and other ebook formats.

Python 117 25 Updated Oct 30, 2025

📚 Freely available programming books

Python 379,034 65,639 Updated Dec 16, 2025

A machine learning software for extracting information from scholarly documents

Java 4,510 525 Updated Dec 18, 2025
Next