Feat: adding `rpc_servers` parameter to `Llama` class #1477

chraac · 2024-05-23T06:23:38Z

This PR include those changes:

Add rpc_servers to Llama, to bypass the rpc_servers to llama.cpp lib
In Makefile, add item to enable LLAMA_RPC flag
Update the llama.cpp submodule reference to latest version

Tested on my machine, work as expected

abetlen · 2024-05-27T14:56:36Z

@chraac great work! Could you also include the build flags in the README as well as maybe a small section on running the rpc servers?

chraac · 2024-05-27T15:01:35Z

@chraac great work! Could you also include the build flags in the README as well as maybe a small section on running the rpc servers?

yeah, sure, will add to readme, thanks for the reply!

chraac · 2024-05-29T04:47:04Z

@abetlen add a section in README.md about how to build the rpc backend.

chraac · 2024-06-04T04:07:22Z

@abetlen , could you have another look in convenient please? have added a section regarding how to build the RPC backend package.

wip

This reverts commit 832c6dd.

juanjfrancisco · 2024-07-09T22:22:45Z

Hi, I apologize for asking this here, but I'm trying to understand how to use the RPC_SERVERS functionality to enable inference across multiple machines. Can anyone help clarify this for me or give me an example?. Thanks in advance.

statchamber · 2024-08-29T23:27:22Z

How do I use this...?

chraac · 2024-08-30T02:29:24Z

Hi @juanjfrancisco @statchamber , sorry for the inconvient here, currently the GGML_RPC flag is disabled by default at the upstream repo, so you maight need to install the python package from the source:

clone the repo: git clone https://github.com/abetlen/llama-cpp-python.git
For rpc client, which is the endpoint that make the rpc request, could be installed by following command into python env:
CMAKE_ARGS='-DGGML_RPC=on -DGGML_OPENMP=on' python3 -m pip install --verbose -e .

For the rpc server that will actually execute the rpc request, could be installed by:

cd vendor/llama.cpp
mmkdir -p build_rpc_client
pushd  build_rpc_client
cmake -DGGML_RPC=ON -DGGML_CUDA=ON .. && cmake --build . --config Release
popd

chraac changed the title ~~adding rpc_servers parameter to Llama class~~ adding rpc_servers parameter to Llama class May 23, 2024

chraac changed the title ~~adding rpc_servers parameter to Llama class~~ Feat: adding rpc_servers parameter to Llama class May 23, 2024

chraac force-pushed the dev-add-rpc branch 3 times, most recently from f86d077 to 00a34ea Compare May 25, 2024 11:15

chraac force-pushed the dev-add-rpc branch 2 times, most recently from 12bac9b to 7854795 Compare May 29, 2024 03:49

chraac force-pushed the dev-add-rpc branch 3 times, most recently from d0a79b8 to 40e3247 Compare June 4, 2024 03:52

chraac added 6 commits June 4, 2024 21:00

passthru rpc_servers params

e8b4f32

wip

enable llama rpc by default

9e1d80f

convert string to byte

fd7bcc9

add rpc package

2f7f83e

Revert "enable llama rpc by default"

aeebfba

This reverts commit 832c6dd.

update readme

ff88fcb

chraac force-pushed the dev-add-rpc branch from 40e3247 to ff88fcb Compare June 4, 2024 13:00

abetlen added 2 commits June 4, 2024 10:37

Only set rpc_servers when provided

1e42468

Add rpc servers to server options

2b5438d

abetlen merged commit d634efc into abetlen:main Jun 4, 2024
16 checks passed

chraac deleted the dev-add-rpc branch June 4, 2024 14:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: adding `rpc_servers` parameter to `Llama` class #1477

Feat: adding `rpc_servers` parameter to `Llama` class #1477

Uh oh!

chraac commented May 23, 2024 •

edited by abetlen

Loading

Uh oh!

abetlen commented May 27, 2024

Uh oh!

chraac commented May 27, 2024

Uh oh!

chraac commented May 29, 2024

Uh oh!

chraac commented Jun 4, 2024

Uh oh!

Uh oh!

juanjfrancisco commented Jul 9, 2024

Uh oh!

statchamber commented Aug 29, 2024

Uh oh!

chraac commented Aug 30, 2024 •

edited

Loading

Uh oh!

Uh oh!

Feat: adding rpc_servers parameter to Llama class #1477

Feat: adding rpc_servers parameter to Llama class #1477

Uh oh!

Conversation

chraac commented May 23, 2024 • edited by abetlen Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abetlen commented May 27, 2024

Uh oh!

chraac commented May 27, 2024

Uh oh!

chraac commented May 29, 2024

Uh oh!

chraac commented Jun 4, 2024

Uh oh!

Uh oh!

juanjfrancisco commented Jul 9, 2024

Uh oh!

statchamber commented Aug 29, 2024

Uh oh!

chraac commented Aug 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Feat: adding `rpc_servers` parameter to `Llama` class #1477

Feat: adding `rpc_servers` parameter to `Llama` class #1477

chraac commented May 23, 2024 •

edited by abetlen

Loading

chraac commented Aug 30, 2024 •

edited

Loading