Candle RWKV 🕯️

A minimalist, high-performance implementation of RWKV (Receptance Weighted Key Value) models using Candle - a lightweight framework for Rust.

中文文档 (Chinese)

🌟 Supported Models

We support the latest and greatest from the RWKV family:

✅ RWKV7 (Goose)
✅ RWKV6 (Finch)
✅ RWKV5 (Eagle)

🚀 Quick Start

Ready to run? Here are the commands to get you started immediately.

1. Run RWKV Models

Run inference directly from the command line.

# Run RWKV7 (Goose)
cargo run --release --example rwkv -- --which "v7-0b1" --prompt "User: why is the sky blue?\n\nAssistant: "

# Run RWKV6 (Finch)
cargo run --release --example rwkv -- --which "v6-1b6" --prompt "User: Hello, how are you?\n\nAssistant: "

2. Quantized Inference (Memory Efficient)

Running on a laptop? Use quantization to save memory.

# Run Quantized RWKV7 (Goose)
cargo run --release --example rwkv -- --quantized --which "v7-0b1" --prompt "User: Tell me a joke.\n\nAssistant: "

🛠️ Advanced Usage: Local Models

If you prefer managing your own model files (e.g. download .pth from HuggingFace), we provide tools to convert and run them.

Conversion

First, convert PyTorch weights (.pth) to SafeTensors for efficient loading in Rust.

# Convert Model Weights
cargo run --release --example convert -- --input ./RWKV-x060-World-1B6-v2.1-20240328-ctx4096.pth

# Convert State Files
cargo run --release --example convert -- --input ./rwkv-x060-chn_single_round_qa-1B6-20240516-ctx2048.pth

Running Local Files

# Run with local converted files
cargo run --release --example rwkv -- \
  --which "v6-1b6" \
  --weight-files ./RWKV-x060-World-1B6-v2.1-20240328-ctx4096.safetensors \
  --state-file ./rwkv-x060-chn_single_round_qa-1B6-20240516-ctx2048.safetensors \
  --prompt "Hello world!"

Quantization (GGUF)

Convert .pth files to standardized GGUF format.

# Quantize .pth to .gguf
cargo run --release --example quantize -- --input ./RWKV-x060-World-1B6-v2.1-20240328-ctx4096.pth

# Run with local GGUF file
cargo run --release --example rwkv -- \
  --quantized \
  --which "v6-1b6" \
  --weight-files ./RWKV-x060-World-1B6-v2.1-20240328-ctx4096-q4k.gguf \
  --prompt "User: Hello!\n\nAssistant: "

🤝 Contributing

Contributions are more than welcome! Feel free to open issues or submit PRs.

Powered by candle

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.cargo		.cargo
examples		examples
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README-zh.md		README-zh.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Candle RWKV 🕯️

🌟 Supported Models

🚀 Quick Start

1. Run RWKV Models

2. Quantized Inference (Memory Efficient)

🛠️ Advanced Usage: Local Models

Conversion

Running Local Files

Quantization (GGUF)

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

nkypy/candle-rwkv

Folders and files

Latest commit

History

Repository files navigation

Candle RWKV 🕯️

🌟 Supported Models

🚀 Quick Start

1. Run RWKV Models

2. Quantized Inference (Memory Efficient)

🛠️ Advanced Usage: Local Models

Conversion

Running Local Files

Quantization (GGUF)

🤝 Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages