feat: support run defined pipeline from config #38

kemingy · 2025-06-30T07:53:04Z

To start a vechord dynamic pipeline service:

import uvicorn

from vechord.registry import VechordRegistry
from vechord.service import create_web_app

if __name__ == "__main__":
    vr = VechordRegistry("run", "postgresql://postgres:[email protected]:5432/")
    app = create_web_app(vr)
    uvicorn.run(app)

Signed-off-by: Keming <[email protected]>

kemingy · 2025-07-01T03:38:36Z

A demo request:

import httpx
import msgspec

ingest = dict(
    name="ragusa",
    data="hello \n world".encode("utf-8"),
    steps=[
        {
            "kind": "chunk",
            "provider": "regex",
            "args": {
                "size": 5,
                "overlap": 0,
            }
        },
        {
            "kind": "embedding",
            "provider": "gemini",
            "args": {
                "api_key": "***"
            }
        },
        {
            "kind": "index",
            "provider": "vectorchord",
            "args": {
                "vector": {
                    "nlist": 1,
                    "distance": "cos",
                }
            }
        }
    ]
)

search = dict(
    name="ragusa",
    data="hi there".encode("utf-8"),
    steps=[
        {
            "kind": "embedding",
            "provider": "gemini",
            "args": {
                "api_key": "***"
            }
        },
        {
            "kind": "search",
            "provider": "vectorchord",
            "args": {
                "vector": {
                    "topk": 1,
                }
            }
        }
    ]
)

for req in (ingest, search):
    resp = httpx.post(
        url="http://localhost:8000/api/run",
        headers={"Content-Type": "application/msgpack"},
        content=msgspec.msgpack.encode(req)
    )
    print(resp)
    print(msgspec.msgpack.decode(resp.content))

cc @xieydd

Copilot

Pull Request Overview

This PR adds support for defining and running dynamic pipelines via a new /api/run endpoint driven by a JSON/msgpack configuration. Key changes include schema updates, pipeline builder implementation, and async extractor adjustments.

Introduce RunRequest and ResourceRequest models for pipeline configuration
Implement run_dynamic_pipeline and build_pipeline in vechord/pipeline.py
Register and expose the new /api/run Falcon endpoint in service.py

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
vechord/service.py	Add `RunResource`, validate and route dynamic pipeline runs
vechord/pipeline.py	New dynamic pipeline builder and executor logic
vechord/model.py	Define `ResourceRequest` and `RunRequest` structures
vechord/rerank.py	Require and inject `COHERE_API_KEY` for async reranking
vechord/registry.py	Update `init_table_index` signature and imports
vechord/extract.py	Convert extractors to async and update Gemini client use
tests/test_table.py	Add a test case for optional keyword in chunks

Comments suppressed due to low confidence (2)

vechord/service.py:135

Set resp.content_type (e.g., falcon.MEDIA_JSON or falcon.MEDIA_MSGPACK) before sending resp.data so clients can correctly interpret the response payload.

            resp.data = encoder.encode(res, enc_hook=vechord_encode_hook)

vechord/service.py:120

[nitpick] Add tests for RunResource and run_dynamic_pipeline to cover both indexing and search pipelines, verifying request validation, error handling, and response encoding.

class RunResource:

vechord/service.py

vechord/pipeline.py

tests/test_table.py

Co-authored-by: Copilot <[email protected]> Signed-off-by: Keming <[email protected]>

Signed-off-by: Keming <[email protected]>

feat: support run defined pipeline from config

130cdab

Signed-off-by: Keming <[email protected]>

kemingy requested a review from Copilot June 30, 2025 07:53

This comment was marked as outdated.

Sign in to view

kemingy added 3 commits June 30, 2025 16:04

args take Any

e65be5d

Signed-off-by: Keming <[email protected]>

run pipeline dyanmically

e6dddde

Signed-off-by: Keming <[email protected]>

await

154797d

Signed-off-by: Keming <[email protected]>

kemingy marked this pull request as ready for review July 1, 2025 03:28

fix search Chunk

1c2e864

Signed-off-by: Keming <[email protected]>

kemingy requested a review from Copilot July 1, 2025 03:38

Copilot AI reviewed Jul 1, 2025

View reviewed changes

vechord/service.py Show resolved Hide resolved

vechord/pipeline.py Show resolved Hide resolved

tests/test_table.py Outdated Show resolved Hide resolved

kemingy and others added 4 commits July 1, 2025 11:40

Update tests/test_table.py

c0c278b

Co-authored-by: Copilot <[email protected]> Signed-off-by: Keming <[email protected]>

fix the run pipeline return type

81061be

Signed-off-by: Keming <[email protected]>

do not return vec/keyword/multivec by default

90c30a9

Signed-off-by: Keming <[email protected]>

add cli for vechord dynamic pipeline service

e0cae0f

Signed-off-by: Keming <[email protected]>

kemingy merged commit f291ad2 into tensorchord:main Jul 1, 2025
4 checks passed

kemingy deleted the run_pipeline branch July 1, 2025 10:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support run defined pipeline from config #38

feat: support run defined pipeline from config #38

Uh oh!

kemingy commented Jun 30, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

kemingy commented Jul 1, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: support run defined pipeline from config #38

feat: support run defined pipeline from config #38

Uh oh!

Conversation

kemingy commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

kemingy commented Jul 1, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kemingy commented Jun 30, 2025 •

edited

Loading