- New York City
Starred repositories
A curated list of awesome open source workflow engines
Rust GUI components for building fantastic cross-platform desktop application by using GPUI.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A Python library for reading and writing PDF, powered by QPDF
Go toolkit for clean, composable, channel-based concurrency
Durable Task Framework allows users to write long running persistent workflows in C# using the async/await capabilities.
SDG is a specialized framework designed to generate high-quality structured tabular data.
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Colivara Evaluation through vidore benchmarks using ndcg@5 metric.
A powerful AI coding agent. Built for the terminal.
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
A tool that converts OpenAPI specifications to MCP server
Allow AI to wade through complex OpenAPIs using Simple Language
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³
A library to help you make the most out of your Pixoo 64 (and hopefully soon other Wi-Fi enabled Pixoos)
us cached road graph, freeways, primary and secondary roads