Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View Deegue's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Deegue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cuDF - GPU DataFrame Library

C++ 9,498 1,012 Updated Feb 27, 2026

Apache DataFusion SQL Query Engine

Rust 8,452 1,971 Updated Feb 27, 2026

Empowering everyone to build reliable and efficient software.

Rust 110,755 14,539 Updated Feb 27, 2026

Axiom is a set of reusable and extensible components designed to be compatible with Velox. Its primary purpose is to simplify the process of building front-ends for query execution powered by Velox.

C++ 60 64 Updated Feb 26, 2026
C++ 118 54 Updated Feb 27, 2026

An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.

Rust 2,741 135 Updated Feb 27, 2026

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,716 209 Updated Feb 27, 2026

Real-time analytics on Postgres tables

Rust 1,933 62 Updated Dec 3, 2025

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 18,163 2,562 Updated Feb 26, 2026

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 130 36 Updated Sep 23, 2025

RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.

Python 368 79 Updated Feb 1, 2026

Power CLI and Workflow manager for LLMs (core package)

Python 3,720 470 Updated Dec 29, 2025

LLM inference in C/C++

C++ 96,045 15,101 Updated Feb 27, 2026

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.

Rust 9,164 852 Updated Feb 27, 2026

ClickBench: a Benchmark For Analytical Databases

HTML 964 252 Updated Feb 27, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,511 7,267 Updated Feb 27, 2026

A memory profiler for Linux.

C 4,761 200 Updated Jul 28, 2023

Graphs for Everyone

Java 15,964 2,569 Updated Feb 10, 2026

LingoDB: A new analytical database system that blurs the lines between databases and compilers.

C++ 297 55 Updated Feb 27, 2026

A modular acceleration toolkit for big data analytic engines

C++ 67 24 Updated May 6, 2024

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

Scala 257 73 Updated Feb 21, 2023

The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.

C++ 9,990 1,873 Updated Feb 27, 2026

JDK main-line development https://openjdk.org/projects/jdk

Java 22,651 6,265 Updated Feb 27, 2026

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 13,272 1,166 Updated Feb 27, 2026

hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)

Java 373 97 Updated Aug 14, 2023

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,697 1,402 Updated Jan 28, 2026

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Python 3,106 1,473 Updated Feb 23, 2026

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,692 732 Updated Feb 26, 2026

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 187 33 Updated Oct 15, 2025

Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.

Scala 1,845 546 Updated May 29, 2024
Next