Thanks to visit codestin.com
Credit goes to github.com

Skip to content

AI Agent for Global Spare Parts business area. RAG Web App developed in order to work as each collaborator's Virtual Assistant.

Notifications You must be signed in to change notification settings

paulinhok14/rag-chatbot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

H.O.L.M.E.S.

[H]istorical [O]bservation and [L]earning [M]aterials [E]ngineering [S]ystem

Version 1.0

holmes logo


The purpose of developing H.O.L.M.E.S. was based on a challenge inherent to different business areas of a company, namely to have a "source of truth" where different types of information, technical or otherwise, can be consulted.

Using the principles of a RAG (Retrieval-Augmented Generation) software architecture, which combines generative neural network models with Documents that provide specific context, the system is capable of:

1- Communicating with the user using Natural Language Processing

2- Answering specific questions from a business area, as long as the model is fed with sufficient data sources in which it can search for the desired information.

How H.O.L.M.E.S. Works

The user writes a question about a specific subject in the area in which he works, be it internal regulations, acronyms, material information, information on purchasing documents, stock, orders, etc.

The system identifies the subject addressed in the question, performs a similarity search in all knowledge bases - structured and unstructured - that were provided to it, feeds on relevant information to answer the question, and creates a response based on the user's question and the specific knowledge base.

Don't worry about "technical jargons", see below the system architecture to work just as it was described above.

rag arch

A challenge that naturally arises when seeking to develop an application like this, lies in choosing the models that will be part of the system architecture.

Not only is it relevant to be concerned about Data Governance, where the knowledge base will be stored and processed, but also the costs associated with maintaining the operation of a system with such value-generating capacity.

Three of the components of the RAG architecture are the matter of our concern. These are the models/libraries: LLM/Chat, Embeddings, Vector Store.

rag arch challenges

Initially, the choice was made for free open-source LLM models running locally, also using vector storage libraries (Vector Stores) to save locally transformed data.

Host: Ollama (Local)
LLM: llama3.2 w/ 2B parameters (Meta)
Embeddings: nomic-embed-text
Vector Store: FAISS - Facebook AI Similarity Search (Meta)

The system's huge scalability potential lies in the simplicity of changing and improving the structure to use more robust models (such as GPT-4, Gemini, etc.) by investing amounts that may be insignificant compared to the potential return, adding different supported knowledge base formats: PDF, Word, SharePoint, Excel spreadsheets, XML, TXT documents, databases, in addition to directly benefiting from the natural evolution of open-source LLM/Chat models.

All this while maintaining a cohesive structure (LangChain) connected to the system's main objectives.

The system's user interface runs on a web application and once the address is accessed, the server will respond by directing to H.O.L.M.E.S. main page, the Chat page.

See below H.O.L.M.E.S. in action!

holmes-in-action.mp4

Accessed via the sidebar is the About page that describes how the system works and its structure.

holmes about page

The next steps will be to evaluate the feasibility of adding new documents as sources of information to the knowledge base, and also of improving the quality of the system by upgrading the models to other higher quality models (LLM/Chat and Embeddings), usually paid for.

Stack:


© Paulo Roberto de Sá Araújo, 2024

About

AI Agent for Global Spare Parts business area. RAG Web App developed in order to work as each collaborator's Virtual Assistant.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published