This example demonstrates how to build a Retrieval-Augmented Generation (RAG) pipeline to search and answer questions over websites using firecrawl and moorcheh.
- Uses firecrawl for web scraping
- Uses Moorcheh's API for embedding, storage, and retrieval
- Clean ingestion and simple pipeline
moorchehwebsite.ipynb– Notebook version of the pipelinemoorchehwebsite.py– Python Script versionqueries.txt– Example legal queriesREADME.md– This file
- Python 3.9+
- Moorcheh API Key (get one at moorcheh.ai)
- Firecrawl API Key
You can either run the pipeline as a Jupyter notebook or a Python script:
Notebook:
jupyter notebook moorchehwebsite.py