RAG Framework and Fine-tuning Generator

This repository contains scripts for constructing, running, and evaluating Retrieval-Augmented Generation (RAG) frameworks, as well as scripts for fine-tuning the generator framework. The project is designed to facilitate research and development in the area of question answering systems and other NLP applications that can benefit from retrieval-augmented generation models.

Overview

The project includes two main scripts:

run_retrieval_logics.py: Constructs, runs, and evaluates RAG frameworks. The configurations for each RAG framework are defined in YAML files located in the cfgs/ directory.
run_fine_tune_model.py: Fine-tunes the generator framework. This script supports both training and inference modes.

Getting Started

Follow these instructions to set up the project environment and run the scripts for your purposes.

Prerequisites

Before running the scripts, ensure you have Python and the necessary packages installed.

pip install -r requirements.txt

RAG framework

To construct an RAG framework, specify its settings in a yaml file, an example is cfg.yaml.

e.g., execute the following to create 2 frameworks from cfg.yaml and cfg_v2.yaml, execute both and report the performance using ROUGE.

python run_retrieval_logics.py --config_files cfgs/cfg.yaml cfgs/cfg_v2.yaml

Fine-tuning Generator

The file is structured around two main components:

Trainer: A class responsible for setting up and executing the fine-tuning process for a causal language model.
Inferencer: A class designed for loading a fine-tuned model and performing inference to generate text based on input prompts.

To run this Generator, first specify its settings in a yaml file, an example is fine_tune_cfg.yaml.

To start the training process, run the script with the --mode argument set to train:

python run_fine_tune_model.py --mode train --config_file cfgs/fine_tune_cfg.yaml

For performing inference with a trained model, run the script with the --mode argument set to inference and optionally provide a question with --eval_q:

python run_fine_tune_model.py --mode inference --config_file cfgs/fine_tune_cfg.yaml --eval_q "How do I claim expense?"

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
cfgs		cfgs
.env_example		.env_example
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
rag_logger.py		rag_logger.py
requirements.txt		requirements.txt
run_fine_tune_model.py		run_fine_tune_model.py
run_retrieval_logics.py		run_retrieval_logics.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Framework and Fine-tuning Generator

Overview

Getting Started

Prerequisites

RAG framework

Fine-tuning Generator

About

Uh oh!

Releases

Packages

Uh oh!

Languages

wanmingHuang/RAG

Folders and files

Latest commit

History

Repository files navigation

RAG Framework and Fine-tuning Generator

Overview

Getting Started

Prerequisites

RAG framework

Fine-tuning Generator

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages