Codestin Search App

dsikka · 2023-09-11T19:45:38Z

Summary:

Includes two other PRs: [server] Update OpenAI Model Support #1300 and [server] Refactor + OpenAI Chat Completion Support #1288
Updates the base routes such that they follow the mlserver convention

Testing

Updated all the server tests
Tested locally using custom routes as well as routes built for the user given the model name

Sample Config:

num_cores: 2
num_workers: 2
endpoints:
  - task: question_answering
    model: zoo:nlp/question_answering/bert-base/pytorch/huggingface/squad/12layer_pruned80_quant-none-vnni
    route: some_route/pruned

This will now produce the following endpoints:

bfineran

LGTM overall - as discussed offline, agree with both @dsikka and @Satrat it's time we move route creation to a separate file / fleshed out system

dbogunowicz

LGTM, clean

bfineran

LGTM - as mentioned before we need to sync w/ QA before landing

mgoin

This seems like it would cause breaking changes to any application built using the old endpoint structure. I think if you could make a README or document showing how to transition current examples of usage, that would be helpful for other teams and users to have a summary of the changes.
For instance, how should we update this Digital Ocean getting started guide? https://marketplace.digitalocean.com/apps/deepsparse-inference-runtime

* refactor server for different integrations; additional functionality for chat completion streaming and non streaming * further refactor server * add support such that openai can host multiple models * update all tests * fix output for n > 1 * add inline comment explaining ProxyPipeline * [server] Update OpenAI Model Support (#1300) * update server * allow users to send requests with new models * use v1; move around baseroutes * add openai path * PR comments * clean-up output classes to be dataclasses, add docstrings, cleanup generation kwargs

…parse into match_mlserver

dsikka added 2 commits September 11, 2023 15:41

update/clean-up server to match mlserver docs

141a6b1

update server tests

ff03ad6

dsikka marked this pull request as ready for review September 11, 2023 21:10

dsikka requested review from Satrat, bfineran, dbogunowicz and rahul-tuli September 12, 2023 14:06

Merge branch 'main' into match_mlserver

36e5649

Satrat reviewed Sep 12, 2023

View reviewed changes

Comment thread src/deepsparse/server/server.py Outdated

Comment thread src/deepsparse/server/server.py Outdated

bfineran reviewed Sep 12, 2023

View reviewed changes

Comment thread src/deepsparse/server/server.py

dbogunowicz previously approved these changes Sep 13, 2023

View reviewed changes

dsikka added 3 commits September 13, 2023 10:44

Merge branch 'main' into match_mlserver

a09fe4a

Merge branch 'main' into match_mlserver

a774639

add back ping

750e422

dsikka dismissed dbogunowicz’s stale review via 750e422 September 26, 2023 20:51

dsikka requested review from Satrat, bfineran and dbogunowicz September 26, 2023 21:00

Merge branch 'main' into match_mlserver

6cffa85

bfineran previously approved these changes Oct 6, 2023

View reviewed changes

mgoin reviewed Oct 10, 2023

View reviewed changes

dsikka dismissed bfineran’s stale review via d99a82c October 10, 2023 13:50

Merge branch 'main' into match_mlserver

0afcd7e

Satrat reviewed Oct 10, 2023

View reviewed changes

Comment thread src/deepsparse/server/cli.py Outdated

Comment thread src/deepsparse/server/openai_server.py

Comment thread src/deepsparse/server/output.py

dsikka added 2 commits October 10, 2023 11:56

update readme, update route cleaning, update docstring

f2327d3

Merge branch 'main' into match_mlserver

c13832b

dsikka requested review from Satrat, bfineran and mgoin October 10, 2023 15:57

bfineran previously approved these changes Oct 10, 2023

View reviewed changes

Merge branch 'main' into match_mlserver

4f7e697

Satrat previously approved these changes Oct 10, 2023

View reviewed changes

dsikka added 2 commits October 10, 2023 15:10

fix README for QA

18913d4

Merge branch 'match_mlserver' of https://github.com/neuralmagic/deeps…

c550799

…parse into match_mlserver

dsikka dismissed stale reviews from Satrat and bfineran via c550799 October 10, 2023 19:11

Merge branch 'main' into match_mlserver

d40fce7

Satrat approved these changes Oct 10, 2023

View reviewed changes

dsikka requested a review from bfineran October 10, 2023 19:36

Merge branch 'main' into match_mlserver

1283671

bfineran approved these changes Oct 11, 2023

View reviewed changes

dsikka merged commit 639e9e4 into main Oct 11, 2023

dsikka deleted the match_mlserver branch October 11, 2023 12:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server] Update server routes to be compliant with MLServer#1237

[server] Update server routes to be compliant with MLServer#1237
dsikka merged 16 commits into
mainfrom
match_mlserver

dsikka commented Sep 11, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

bfineran left a comment

Uh oh!

Uh oh!

dbogunowicz left a comment

Uh oh!

bfineran left a comment

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

dsikka commented Sep 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Testing

Uh oh!

Uh oh!

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dbogunowicz left a comment

Choose a reason for hiding this comment

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dsikka commented Sep 11, 2023 •

edited

Loading