Lists (19)
Sort Name ascending (A-Z)
Starred repositories
Jupyter notebooks for the Natural Language Processing with Transformers book
Supercharge Your LLM Application Evaluations 🚀
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Get your documents ready for gen AI
A lightweight LMM-based Document Parsing Model
Multilingual Document Layout Parsing in a Single Vision-Language Model
A collection of benchmarks and datasets for evaluating LLM.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Tesseract Open Source OCR Engine (main repository)
Implementation of Nougat Neural Optical Understanding for Academic Documents
This repository contains demos I made with the Transformers library by HuggingFace.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
A curated list of 120+ LLM libraries category wise.
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
Code and pre-trained models for our paper "CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection".
[ICCV 2023] Official implementation of the paper: "DIRE for Diffusion-Generated Image Detection"
Code for the paper: CNN-generated images are surprisingly easy to spot... for now https://peterwang512.github.io/CNNDetection/
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
MTEB: Massive Text Embedding Benchmark
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, con…