Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View gto7's full-sized avatar

Block or report gto7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Ollama Python library

Python 8,739 841 Updated Oct 7, 2025

Examples and tutorials to help developers build AI systems

Python 3,323 1,131 Updated Oct 8, 2025

Python library for Agentic Document Extraction from LandingAI

Python 2,123 223 Updated Oct 22, 2025

An introduction to PyTest with lots of simple, hackable examples

Python 396 159 Updated Nov 27, 2023

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 8,974 815 Updated Jul 20, 2025

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 23,140 2,442 Updated Oct 23, 2025

Production-ready platform for agentic workflow development.

TypeScript 117,124 18,095 Updated Oct 23, 2025

borb is a library for reading, creating and manipulating PDF files in python.

Python 3,537 150 Updated Oct 20, 2025

Python bindings to PDFium, reasonably cross-platform.

Python 657 33 Updated Oct 22, 2025

Python SDK for Milvus Vector Database

Python 1,287 380 Updated Oct 23, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,077 681 Updated Jul 10, 2025

Simple package to extract text with coordinates from programmatic PDFs

C++ 207 46 Updated Oct 20, 2025

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 26,710 1,864 Updated Oct 23, 2025

qpdf: A content-preserving PDF document transformer

C++ 4,411 334 Updated Oct 22, 2025

Benchmarking PDF libraries

Python 314 20 Updated Jul 2, 2025

Get your documents ready for gen AI

Python 42,113 3,013 Updated Oct 23, 2025

An Improved Langchain RAG Tutorial (v2) with local LLMs, database updates, and testing.

Python 901 582 Updated Aug 3, 2024

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 38,064 3,479 Updated Oct 23, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 29,373 1,957 Updated Oct 21, 2025

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,747 256 Updated Mar 25, 2025
Python 43 3 Updated Jul 9, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,518 30,916 Updated Oct 23, 2025

Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

Python 203 28 Updated May 23, 2025

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition using Pytorch

Python 294 76 Updated Feb 9, 2023

End to end solution for migrating CSV data into a Neo4j graph using an LLM for the data discovery and graph data modeling stages.

Python 139 21 Updated Dec 6, 2024

GraphRAG: Knowledge in Graphs not Documents

Python 16 2 Updated Jul 5, 2025

🌄 Open Source AI & Data Landscape - provides overview of top tier projects in the open source AI and Data ecosystem, shows projects through GitHub data, funding or market cap, first and last commit…

369 118 Updated Oct 23, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 28,795 3,009 Updated Oct 23, 2025

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

Python 158 14 Updated Sep 8, 2025
Next