-
GPU-Accelerating Apache Spark @NVIDIA
- United States
Stars
A curated list of practical Codex skills for automating workflows across the Codex CLI and API.
Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.
Simple, portable, and self-contained stacktrace library for C++11 and newer
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
stdgpu: Efficient STL-like Data Structures on the GPU
Slint is an open-source declarative GUI toolkit to build native user interfaces for Rust, C++, JavaScript, or Python apps.
Create a Movie animation plus Audio plus Subtitle from a text file
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
Notes talking about the design and implementation of Apache Spark
Examples of single-cell genomic analysis accelerated with RAPIDS
The fastest logging library in the world. Built from scratch in Scala and programmatically configurable.
Scala library for boilerplate-free, type-safe data transformations
Extremely fast Query Engine for DataFrames, written in Rust
Spark RAPIDS Container β Docker containers for Spark RAPIDS
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
A creator library for procedural 2D noises and patterns in Rust.
A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
NVIDIA Federated Learning Application Runtime Environment
Notes from books and other interesting things that I've read. Table of contents at the end π
All Algorithms implemented in Python
Spark RAPIDS Benchmarks β benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.
Data science interview questions and answers