12 unstable releases (3 breaking)
| 0.4.0 | Nov 26, 2025 |
|---|---|
| 0.3.2 | Aug 29, 2025 |
| 0.3.1 | Jul 25, 2025 |
| 0.2.4 | Jul 9, 2025 |
| 0.1.3 | Jul 1, 2025 |
#257 in Machine learning
2MB
22K
SLoC
Veloxx: Ultra-High Performance Data Processing & Analytics Library
๐ v0.4.0 Released! Major performance overhaul with SIMD acceleration, Pivot, and Outer Join support.
Veloxx is a blazing-fast, ultra-lightweight data processing and analytics library in Rust, with seamless bindings for Python and WebAssembly. Built from the ground up for maximum performance, featuring advanced SIMD acceleration, memory optimization, and parallel processing that often outperforms industry leaders.
๐ Performance Highlights
- SIMD Acceleration: Vectorized aggregation (
sum,mean,min,max) now 30-90x faster than scalar implementations. - Parallel Processing: Hybrid execution strategy using Rayon for large datasets (>500k rows), achieving near-linear scaling.
- Optimized I/O: Multi-threaded memory-mapped CSV reading and zero-copy Parquet integration.
- Lazy Evaluation: Refined Query Optimizer with predicate pushdown for efficient filtering.
โจ New Features (v0.4.0)
- Pivot: Reshape DataFrames from long to wide format with aggregation.
- Outer Join: Full support for
Left,Right,Inner, andOuterjoins. - Deterministic Columns: Refactored internal storage to guarantee consistent column ordering.
- Python Bindings: Updated
PyDataFramewithpivotandouter_joinsupport.
โจ Project Links
- ๐ฆ Rust crate (crates.io)
- ๐ Python package (PyPI)
- ๐ฆ JavaScript package (npm)
- ๐ GitHub
- ๐ Online Documentation
๐งฉ Core Principles & Design Goals
- ๐ Performance First: Advanced SIMD, parallel processing, cache-optimized algorithms
- ๐ชถ Lightweight: Minimal dependencies, optimized memory footprint
- ๐ฆบ Safety & Reliability: Memory-safe Rust, comprehensive testing
- ๐งโ๐ป Developer Experience: Intuitive APIs, excellent documentation
- ๐ง Production Ready: Zero-warning compilation, extensive benchmarking
๐ฉ Key Features
Core Data Structures
- DataFrame and Series for lightning-fast tabular data processing
- SIMD-optimized operations with AVX2/NEON acceleration
- Memory-efficient storage with advanced compression
High-Performance Operations
- ๐ Ultra-fast analytics: filtering, joining, grouping, aggregation, pivoting
- ๐ Advanced statistics: correlation, regression, time-series analysis
- Parallel processing: Multi-threaded execution with work-stealing
- ๐งฎ Vectorized math: SIMD-accelerated arithmetic operations
Advanced I/O & Integration
- ๐ Multiple formats: CSV, JSON, Parquet support
- ๐ Database connectivity: SQLite, PostgreSQL, MySQL
- ๐ Streaming operations: Memory-efficient large dataset processing
- โก Async I/O: Non-blocking file and network operations
Data Quality & ML
- ๐งน Data cleaning: Automated outlier detection, validation
- ๐ค Machine learning: Linear/logistic regression, clustering, preprocessing
- ๐ Visualization: Charts, plots, statistical graphics
- ๐ Data profiling: Schema inference, quality metrics
Multi-Language Support
- ๐ฆ Rust: Native, zero-cost abstractions
- Python: PyO3 bindings with NumPy integration
- ๐ WebAssembly: Browser and Node.js support
- ๐ฆ Easy installation: Available on crates.io, PyPI, npm
โก Quick Start
Rust
[dependencies]
veloxx = "0.4.0"
use veloxx::dataframe::DataFrame;
use veloxx::series::Series;
let df = DataFrame::new_from_csv("data.csv")?;
let filtered = df.filter(&your_condition)?;
let grouped = df.group_by(vec!["category"]).agg(vec![("amount", "sum")])?;
Python
import veloxx
df = veloxx.PyDataFrame({"name": veloxx.PySeries("name", ["Alice", "Bob"])})
filtered = df.filter(...)
pivoted = df.pivot(values="score", index=["name"], columns="subject", agg_fn="mean")
JavaScript/Wasm
const veloxx = require("veloxx");
const df = new veloxx.WasmDataFrame({name: ["Alice", "Bob"]});
const filtered = df.filter(...);
๐ ๏ธ Feature Flags
Enable only what you need:
advanced_ioโ Parquet, databases, asyncdata_qualityโ Schema checks, anomaly detectionwindow_functionsโ Window analyticsvisualizationโ Chartingmlโ Machine learningpythonโ Python bindingswasmโ WebAssembly
๐ Documentation
๐งโ๐ป Examples
Run ready-made examples:
cargo run --example basic_dataframe_operations
cargo run --example advanced_io --features advanced_io
# ... more in the examples/ folder
๐ค Contributing
See CONTRIBUTING.md for guidelines. Please review our Code of Conduct.
๐ฌ Support
- Issues: https://github.com/Conqxeror/veloxx/issues
- Discussions: https://github.com/Conqxeror/veloxx/discussions
- Documentation: https://conqxeror.github.io/veloxx/
๐ License
MIT License. See LICENSE.
Dependencies
~8โ34MB
~419K SLoC