BPlusTree

High-performance B+ tree implementations for Rust and Python, designed for efficient range queries and sequential access patterns.

🚀 Dual-Language Implementation

This project provides complete, optimized B+ tree implementations in both languages:

🦀 Rust Implementation - Zero-cost abstractions, arena-based memory management
🐍 Python Implementation - Competitive with SortedDict, optimized for specific use cases

📊 Performance Highlights

Rust Implementation

32-68% faster range scans than std::BTreeMap (1.5-2.8x throughput)
23-68% faster GET operations across all dataset sizes
2-22% faster insertions with excellent scaling
Trade-off: 34% slower deletes in optimized scenarios

Python Implementation

Up to 2.5x faster than SortedDict for partial range scans
1.4x faster for medium range queries
Excellent scaling for large dataset iteration

🎯 Choose Your Implementation

Use Case	Rust	Python
Systems programming	✅ Primary choice	❌
High-performance applications	✅ Zero-cost abstractions	⚠️ Good for specific patterns
Database engines	✅ Full control	⚠️ Limited
Data analytics	✅ Fast	✅ Great for range queries
Rapid prototyping	⚠️ Learning curve	✅ Easy integration
Existing Python codebase	❌	✅ Drop-in replacement

🚀 Quick Start

Rust

use bplustree::BPlusTreeMap;

let mut tree = BPlusTreeMap::new(16).unwrap();
tree.insert(1, "one");
tree.insert(2, "two");

// Range queries with Rust syntax!
for (key, value) in tree.range(1..=2) {
    println!("{}: {}", key, value);
}

Python

from bplustree import BPlusTree

tree = BPlusTree(capacity=128)
tree[1] = "one"
tree[2] = "two"

# Range queries
for key, value in tree.range(1, 2):
    print(f"{key}: {value}")

📖 Documentation

📚 Technical Documentation - Architecture, algorithms, benchmarks
🦀 Rust Documentation - Rust-specific usage and examples
🐍 Python Documentation - Python-specific usage and examples

Performance Characteristics

BPlusTreeMap demonstrates significant performance advantages in range operations and read-heavy workloads compared to Rust's standard BTreeMap. Comprehensive benchmarking across dataset sizes from 1K to 10M entries reveals that BPlusTreeMap consistently outperforms BTreeMap in range scans by 32-68%, delivering 1.5-2.8x higher throughput (67K-212K vs 44K-83K items/ms). GET operations show similarly strong advantages, with BPlusTreeMap performing 23-68% faster across all scales, making it particularly well-suited for read-heavy applications and analytical workloads.

Insert performance is competitive to superior, with BPlusTreeMap showing 2-22% faster insertion speeds depending on dataset size and configuration. The implementation scales exceptionally well, with larger datasets (>1M entries) showing the most pronounced advantages. However, delete operations represent the primary trade-off, with BPlusTreeMap performing 34% slower in optimized scenarios and 1.7-10.5x slower depending on capacity configuration, particularly at high capacities (1024+ elements per node).

Capacity configuration is critical for optimal performance. The B+ tree implementation allows tuning of node capacity, with optimal settings varying by use case: capacity 64-128 for datasets under 10K entries, 128-256 for medium datasets (10K-100K), and 256-512 for large datasets (100K-1M+). Proper configuration can achieve near-optimal performance across all operations, while misconfiguration (particularly high capacities with delete-heavy workloads) can significantly impact performance.

BPlusTreeMap is recommended for range-heavy workloads (>20% range scans), read-heavy applications (>60% gets), large dataset analytics, and mixed workloads with light-to-moderate delete operations (<15% deletes). Standard BTreeMap remains preferable for delete-heavy workloads, small datasets with unknown access patterns, or applications requiring zero configuration. The performance characteristics make BPlusTreeMap particularly valuable for database-like applications, time-series analysis, and any scenario where range queries and sequential access patterns dominate.

🏗️ Architecture

Both implementations share core design principles:

Arena-based memory management for efficiency
Linked leaf nodes for fast sequential access
Hybrid navigation combining tree traversal + linked list iteration
Optimized rebalancing with reduced duplicate lookups
Comprehensive testing including adversarial test patterns

🛠️ Development

Rust Development

cd rust/
cargo test --features testing
cargo bench

Python Development

cd python/
pip install -e .
python -m pytest tests/

Cross-Language Benchmarking

python scripts/analyze_benchmarks.py

🤝 Contributing

This project follows Test-Driven Development and Tidy First principles:

Write tests first - All features start with failing tests
Small, focused commits - Separate structural and behavioral changes
Comprehensive validation - Both implementations tested against reference implementations
Performance awareness - All changes benchmarked for performance impact

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔗 Links

GitHub Repository
Rust Crate (coming soon)
Python Package (coming soon)

Built with ❤️ following Kent Beck's Test-Driven Development methodology.

Name		Name	Last commit message	Last commit date
Latest commit History 461 Commits
.claude		.claude
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
python		python
rust		rust
scripts		scripts
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
agent.md		agent.md
analyze_programming_time.py		analyze_programming_time.py
arena_elimination_analysis.md		arena_elimination_analysis.md
commits.txt		commits.txt
programming_time_comprehensive.png		programming_time_comprehensive.png
rust-toolchain.toml		rust-toolchain.toml
simple_time_analysis.py		simple_time_analysis.py
test_coverage_analysis.md		test_coverage_analysis.md
visualize_programming_time.py		visualize_programming_time.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BPlusTree

🚀 Dual-Language Implementation

📊 Performance Highlights

Rust Implementation

Python Implementation

🎯 Choose Your Implementation

🚀 Quick Start

Rust

Python

📖 Documentation

Performance Characteristics

🏗️ Architecture

🛠️ Development

Rust Development

Python Development

Cross-Language Benchmarking

🤝 Contributing

📄 License

🔗 Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

KentBeck/BPlusTree3

Folders and files

Latest commit

History

Repository files navigation

BPlusTree

🚀 Dual-Language Implementation

📊 Performance Highlights

Rust Implementation

Python Implementation

🎯 Choose Your Implementation

🚀 Quick Start

Rust

Python

📖 Documentation

Performance Characteristics

🏗️ Architecture

🛠️ Development

Rust Development

Python Development

Cross-Language Benchmarking

🤝 Contributing

📄 License

🔗 Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages