valentinus

A thread-safe vector database for model inference inside LMDB.

dependencies

wincode/serde - serialize/deserialize
lmdb-rs - database bindings
ndarray - numpy equivalent
ort/onnx - embeddings

getting started

NOTE: ensure you have the development packages below (e.g. for Fedora)

sudo dnf install openssl-devel
sudo dnf install gcc-c++

git clone https://github.com/kn0sys/valentinus && cd valentinus

optional environment variables

var	usage	default
`LMDB_USER`	working directory of the user for database	$USER
`LMDB_MAP_SIZE`	Sets max environment size, i.e. size in memory/disk of all data	20% of available memory
`ONNX_PARALLEL_THREADS`	parallel execution mode for this session	1
`VALENTINUS_CUSTOM_DIM`	embeddings dimensions for custom models	all-mini-lm-6 -> 384
`VALENTINUS_LMDB_ENV`	environment for the database (i.e. test, prod)	test

tests

Note: all tests currently require the all-MiniLM-L6-v2_onnx directory
Get the model.onnx and tokenizer.json from huggingface or build them

mkdir all-MiniLM-L6-v2_onnx \
&& cd all-MiniLM-L6-v2_onnx \
&& wget https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/resolve/main/config.json \
&& wget https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/resolve/main/onnx/model.onnx \
&& wget https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/resolve/main/special_tokens_map.json \
&& wget https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/resolve/main/tokenizer_config.json \
&& wget https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/resolve/main/tokenizer.json \
&& wget https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/resolve/main/vocab.txt

cargo test

examples

see examples

reference

inspired by this chromadb python tutorial

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.github/workflows		.github/workflows
data		data
examples		examples
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

valentinus

dependencies

getting started

optional environment variables

tests

examples

reference

About

Uh oh!

Releases 30

Packages

Uh oh!

Languages

License

kn0sys/valentinus

Folders and files

Latest commit

History

Repository files navigation

valentinus

dependencies

getting started

optional environment variables

tests

examples

reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 30

Packages 0

Uh oh!

Languages

Packages