New RAG example #427

esnible · 2025-02-14T20:02:31Z

This is a RAG example that uses the correct lifecycle and the Milvus vector database.

Questions. I need feedback before merging.

@hirzel Is it OK to remove your old example? Or should we have both?
I'm not sure if docs/assets/pdl_quick_reference.pdf is the best example PDF to demo, as it has few sentences.
I was not able to get stop_sequences respected. How would a user debug that?
This logs many "LiteLLM:WARNING: logging_utils.py:113 - logging_obj not found - unable to track `llm_api_duration_ms". I am not sure if these are my fault.
I wanted to see the retrieved sentences. I think Nick's debugger had this, but it was hard to see, so I wrote PDL code to print them. Is there a better way?
@starpit The args don't show on function blocks in the debugger, but should. The def name doesn't show on the call, but should. It may be better to show it as a repl rather than as Markdown.

hirzel · 2025-02-14T21:19:08Z

Is it OK to remove your old example? Or should we have both?

We could rename the old example to "tfidf_rag" or something like that. And then, we can use the name "rag" for your new example, to indicate that it is now the main official RAG example.

mandel

Nice example!

When trying to execute the program, I had a few dependency missing. I had to do:

 pip install pymilvus pypdf

You could also say in the readme which models to install with ollama.

And here are some suggestions on how to rewrite pdf_index.pdl:

# Load PDF document into vector database

description: Load document into vector database
lastOf:
- include: rag_library1.pdl
- defs:
    input_data:
      call: ${ pdf_parse }
      args:
        filename: "docs/assets/pdl_quick_reference.pdf"
        chunk_size: 400
        chunk_overlap: 100  
  call: ${ rag_index }
  args:
    inp: ${ input_data }
    encoder_model: "ollama/mxbai-embed-large"
    embed_dimension: 1024
    database_name: "./pdl-rag-demo.db"
    collection_name: "pdl_rag_collection"
- "Success!"

and pdf_query.pdl:

# Query vector database for relevant passages; use passages to query LLM.

defs:
  QUESTIONS:
    data: [
      "Does PDL have a contribute keyword?",
      "Is Brooklyn the capital of New York?"
    ]
lastOf:
  - include: rag_library1.pdl
  - defs:
      CONCLUSIONS:
        for:
          question: ${ QUESTIONS }
        repeat:
            # Define MATCHING_PASSAGES as the text retrieved from the vector DB
          defs:
            MATCHING_PASSAGES:
              call: ${ rag_retrieve }
              args:
                # I am passing the client in implicitly.  NOT WHAT I WANT
                inp: ${ question }
                encoder_model: "ollama/mxbai-embed-large"
                limit: 3
                collection_name: "pdl_rag_collection"  
                database_name: "./pdl-rag-demo.db"
            # debug:
            #   lang: python
            #   code: |
            #      print(f"MATCHING_PASSAGES='{MATCHING_PASSAGES}'")
            #      result = None
            CONCLUSION:
              model: ollama/granite-code:8b
              input: >
                Here is some information:
                ${ MATCHING_PASSAGES }
                Question: ${ question }
                Answer:
              parameters:
                # I couldn't get this working
                stop_sequences: ['Yes', 'No']
                temperature: 0
            ANSWER:
              lang: python
              code: |
                # split()[0] needed because of stop_sequences not working
                # print(f"CONCLUSION={CONCLUSION}")
                result = CONCLUSION.split()[0]
          data:
            ${question}: ${ANSWER}
        join:
          as: array
    text: "${ CONCLUSIONS | tojson }\n"

vazirim

Example looks great! See comments.

Langchain provides different types of retrievers with a unified interface. Do the abstractions here reflect that unified interface?
https://python.langchain.com/docs/concepts/retrievers/

vazirim · 2025-02-16T11:47:19Z

examples/rag/pdf_query.pdl

To make the model stop, use stop instead of stop_sequences

vazirim · 2025-02-16T11:56:44Z

examples/rag/README.md

Could we keep both RAG examples? It would be good for the README to reflect that.

Signed-off-by: Ed Snible <[email protected]>

vazirim

LGTM

* New RAG example

esnible requested a review from vazirim February 14, 2025 20:02

mandel reviewed Feb 14, 2025

View reviewed changes

vazirim requested changes Feb 16, 2025

View reviewed changes

New RAG example

ca278e1

Signed-off-by: Ed Snible <[email protected]>

esnible force-pushed the newrag branch from b315e65 to ca278e1 Compare February 24, 2025 15:41

esnible added 3 commits February 24, 2025 10:52

Lint pyproject.toml

a736767

Signed-off-by: Ed Snible <[email protected]>

Fix RAG index example

716759f

Signed-off-by: Ed Snible <[email protected]>

Fix example

0fa7a0d

Signed-off-by: Ed Snible <[email protected]>

vazirim approved these changes Feb 24, 2025

View reviewed changes

esnible merged commit 8964dd5 into IBM:main Feb 24, 2025
6 checks passed

esnible deleted the newrag branch February 24, 2025 21:33

esnible mentioned this pull request Feb 25, 2025

Deeper integration of RAG in PDL #46

Closed

jgchn pushed a commit to jgchn/prompt-declaration-language that referenced this pull request Feb 25, 2025

New RAG example (IBM#427)

d86f5d2

* New RAG example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New RAG example #427

New RAG example #427

Uh oh!

esnible commented Feb 14, 2025

Uh oh!

hirzel commented Feb 14, 2025

Uh oh!

mandel left a comment

Uh oh!

vazirim left a comment

Uh oh!

vazirim Feb 16, 2025

Uh oh!

vazirim Feb 16, 2025

Uh oh!

vazirim left a comment

Uh oh!

Uh oh!

Uh oh!

New RAG example #427

New RAG example #427

Uh oh!

Conversation

esnible commented Feb 14, 2025

Uh oh!

hirzel commented Feb 14, 2025

Uh oh!

mandel left a comment

Choose a reason for hiding this comment

Uh oh!

vazirim left a comment

Choose a reason for hiding this comment

Uh oh!

vazirim Feb 16, 2025

Choose a reason for hiding this comment

Uh oh!

vazirim Feb 16, 2025

Choose a reason for hiding this comment

Uh oh!

vazirim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!