-
Notifications
You must be signed in to change notification settings - Fork 118
[wip]Develop/embedding grpc server #315
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
db3214a to
223cca9
Compare
|
|
||
| grpc::Status health(grpc::ServerContext* context, const EmbeddingHealthRequestPB* request, EmptyPB* writer); | ||
|
|
||
| private: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
engine 里面的其他接口打算实现吗?worker_status? cache_status
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
意思应该是local rpc server里实现worker_status cache_status,替代ops吧,是要的
| py::object custom_module = model.attr("custom_module"); | ||
| py::object py_render = model.attr("custom_module").attr("renderer"); | ||
| py::object py_tokenizer = model.attr("tokenizer"); | ||
| py::object py_handler = model.attr("custom_module").attr("handler"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
后处理还没处理
| return await self.embedding(request, raw_request) | ||
|
|
||
| def _handle_exception(self, request: Dict[str, Any], e: BaseException): | ||
| exception_json = format_exception(e) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
还需要吗
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
_handle_exception这种函数等后面集中处理一下
57743c4 to
6811469
Compare
施工中