🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
- 
            Updated
            Oct 21, 2025 
- Python
🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
ChatGPT PROMPTs Splitter. Tool for safely process chunks of up to 15,000 characters per request
Fully neural approach for text chunking
🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
An agent with human in the loop that can search the web for information while bypassing bot detection for private sites.
We compared LangChain, Fixie, and Marvin
In this we implements a Retrieval-Augmented Generation (RAG) based conversational AI agent designed for intelligent knowledge extraction from PDF documents. Leveraging LangChain and Google’s Gemini LLM
JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Java applications
Generative AI projetc using LangChain for similarity search. Input 3 articles urls and ask something about the topic
An exploration of text splitting and chunking in JavaScript
A lightweight TypeScript text splitter for RAG applications
Free AI Prompt Splitter - Split large documents into chunks for ChatGPT, Claude, GPT-4. Supports PDF, TXT, MD files. Smart token counting & overlap control.
Leveraging Langchain for a RAG (Retriever Augmented Generation) project, this implementation enables efficient querying across multiple books, enhancing data retrieval and natural language generation for context-rich answers.
This repository covers all the code materials covered within Jose Portilla's Langchain with Python Bootcamp on Udemy.
A smart C# text splitting library that intelligently chunks text while preserving semantic boundaries. Uses a hierarchical approach with configurable overlap and detailed metadata.
Allows you to upload to GitHub text files over 100MB
Kardenwort is an intelligent tools for language learners that transforms complex texts and words into simple, clear, and context-rich vocabulary lists
Add a description, image, and links to the text-splitter topic page so that developers can more easily learn about it.
To associate your repository with the text-splitter topic, visit your repo's landing page and select "manage topics."