-
midrash_auto_annotate_asc Public
Forked from TAU-CH/midrash_auto_annotate_ascJupyter Notebook UpdatedJan 24, 2026 -
GutenOCR Public
Forked from Roots-Automation/GutenOCROpen-source tools for training and evaluating Vision Language Models for OCR
Python Apache License 2.0 UpdatedJan 22, 2026 -
-
GenizahSearch Public
Forked from gershuni/GenizahSearchAdvanced Search & Analysis Tool for the Cairo Genizah Corpus
Python MIT License UpdatedJan 12, 2026 -
-
custom_d_fine Public
Forked from ArgoHA/D-FINE-segD-FINE: SoTA Object Detection model custom training/exporting/inferencing pipeline from scratch
Python Apache License 2.0 UpdatedJan 3, 2026 -
pangolinos Public
PangoLine on Steroids - Enhanced synthetic document rendering with precise polygon extraction
-
corpora Public
Forked from CopticScriptorium/corporaPublic repository for Coptic SCRIPTORIUM Corpora Releases
CSS UpdatedDec 12, 2025 -
-
rt-detrv4 Public
Forked from RT-DETRs/RT-DETRv4Official implementation of RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models
-
LassbergEL Public
Forked from michaelscho/LassbergELEntity recognition and linking pipeline for historical letters.
Python UpdatedNov 21, 2025 -
pageshrink Public
Forked from jahtz/pageshrinkShrink region polygons in PageXML files without intersecting with its content.
Python Apache License 2.0 UpdatedNov 19, 2025 -
open-lovable Public
Forked from firecrawl/open-lovable🔥 Clone and recreate any website as a modern React app in seconds
TypeScript MIT License UpdatedNov 19, 2025 -
TheBigPromptLibrary Public
Forked from 0xeb/TheBigPromptLibraryA collection of prompts, system prompts and LLM instructions
HTML MIT License UpdatedNov 19, 2025 -
LibreTranslate Public
Forked from LibreTranslate/LibreTranslateFree and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Python GNU Affero General Public License v3.0 UpdatedNov 6, 2025 -
argos-train Public
Forked from argosopentech/argos-trainTraining scripts for Argos Translate
Python MIT License UpdatedNov 3, 2025 -
Churro Public
Forked from stanford-oval/ChurroCHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition
Python Apache License 2.0 UpdatedNov 2, 2025 -
geniza Public
Forked from Princeton-CDH/genizaversion 4.x of the Princeton Geniza Project
Python Apache License 2.0 UpdatedOct 30, 2025 -
strix-halo-testing Public
Forked from lhl/strix-halo-testingJupyter Notebook Apache License 2.0 UpdatedOct 30, 2025 -
bleed-through-cleaner-app Public
Forked from yahyamomtaz/bleed-through-cleaner-appPyTorch-based web application for automatic bleed-through removal in ancient manuscripts
Python UpdatedOct 29, 2025 -
cuc Public
Forked from DT-UCPH/cucContains a text fabric dataset of the Ugaritic corpus.
HCL UpdatedOct 27, 2025 -
hieropy Public
Forked from nederhof/hieropyPython implementation of ancient Egyptian hieroglyphic encoding
Python GNU General Public License v3.0 UpdatedOct 26, 2025 -
pagexml-mets-viewer Public
Forked from CrazyCrud/pagexml-mets-viewerWeb app to upload and display multiple PageXML files
-
page-to-rf-detr-train Public
Convert PAGE-XML to COCO and train a rf-detr seg model
-
-
gddr6-core-junction-vram-temps Public
Forked from ThomasBaruzier/gddr6-core-junction-vram-tempsCore, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs
C Apache License 2.0 UpdatedOct 22, 2025 -
argos-translate Public
Forked from argosopentech/argos-translateOpen-source offline translation library written in Python
Python MIT License UpdatedOct 21, 2025 -
OcrDiffAlign Public
This project provides tools to align noisy OCR output against a reference text
-
bert-phonemizer Public
Forked from thewh1teagle/bert-phonemizerHebrew phonemizer based on Dicta bert with casual attention
Python UpdatedOct 19, 2025 -
RAW_InvisibleEast Public
Forked from OpenITI/RAW_InvisibleEasttexts of the invisible east project
HTML UpdatedOct 17, 2025