Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View idalin6127's full-sized avatar

Block or report idalin6127

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Module7-Synthetic-Data-Generation-Fine-Tuning-QLoRA-Assignment Module7-Synthetic-Data-Generation-Fine-Tuning-QLoRA-Assignment Public

    Synthetic Data Generation & Fine-Tuning (QLoRA) Assignment

    Python

  2. Module6-About-Voice-Agent-with-Function-Calling Module6-About-Voice-Agent-with-Function-Calling Public

    Voice Agent with Function Calling

    Python

  3. Module5-Hybrid-Retriever-SFT-FAISS-FTS5-BM25-LoRA-QLoRA Module5-Hybrid-Retriever-SFT-FAISS-FTS5-BM25-LoRA-QLoRA Public

    Week 5 project: build a hybrid retriever that fuses FAISS dense vectors with SQLite FTS5/BM25 keyword search (RRF/weighted fusion), plus a Supervised Fine-Tuning (SFT) pipeline (Full FT vs LoRA/QLo…

    Python

  4. Module4-AG-Pipeline-over-Personal-Docs-FAISS-Chroma-LangChain-QA-OpenAI-vLLM Module4-AG-Pipeline-over-Personal-Docs-FAISS-Chroma-LangChain-QA-OpenAI-vLLM Public

    Week 4 project: build a Retrieval-Augmented Generation (RAG) pipeline over your own docs (resume/portfolio). Includes embedding, vector DB (FAISS/Chroma), retrieval, LangChain QA, and evaluation; s…

    Python

  5. Module3-Mini-Pretraining-Data-Local-Voice-Assistant-OCR-Web-ASR-LLM-TTS Module3-Mini-Pretraining-Data-Local-Voice-Assistant-OCR-Web-ASR-LLM-TTS Public

    Week 3 project combining a mini pretraining data pipeline (web scraping, OCR, cleaning, deduplication) and a local real-time voice assistant (ASR, LLM, TTS).

    Jupyter Notebook

  6. Module2_v1-Data-Collection-Extraction-OCR-Web-ASR-Cleaning-Dedup Module2_v1-Data-Collection-Extraction-OCR-Web-ASR-Cleaning-Dedup Public

    Hands-on project for data collection & extraction (Week 2). Implements OCR (Tesseract), web scraping (arXiv), PDF text extraction, automatic speech recognition (Whisper), and dataset cleaning/dedup…

    Python