Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View xxhdx1985126's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xxhdx1985126

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Multimodal-Composite-Editing-and-Retrieval-update

35 1 Updated Oct 13, 2025

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

57 5 Updated Dec 17, 2025

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 2,875 469 Updated Jan 21, 2026

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,588 360 Updated Jul 25, 2025

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 619 47 Updated Sep 15, 2025

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,639 2,829 Updated Jan 8, 2026

Storage Orchestration for Kubernetes

Go 13,336 2,805 Updated Jan 20, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,342 2,707 Updated Aug 12, 2024

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 855 52 Updated Jul 29, 2024

Code release for VTW (AAAI 2025 Oral)

Python 64 1 Updated Nov 4, 2025

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 93,268 6,832 Updated Jan 20, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 91,880 10,702 Updated Jan 21, 2026

High performance distributed cache system. Built by Rust.

Rust 1 Updated Jan 21, 2026

magic-trace collects and displays high-resolution traces of what a process is doing

OCaml 5,205 119 Updated Jan 14, 2026

This is the user space repo for famfs, the fabric-attached memory file system

C 90 3 Updated Jan 15, 2026

The universal proxy platform

Go 29,853 3,523 Updated Jan 17, 2026

Rule Snippet & Rule Set for Surge / Mihomo (Clash.Meta) / Clash Premium (Dreamacro) / sing-box / Surfboard for Android / Stash

TypeScript 3,618 261 Updated Jan 20, 2026

daed, A modern web dashboard for dae.

TypeScript 1,071 107 Updated Dec 15, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,318 4,688 Updated Jan 20, 2026

Ongoing research training transformer models at scale

Python 14,975 3,511 Updated Jan 21, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,978 12,733 Updated Jan 21, 2026

[EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners

Python 17 3 Updated Nov 17, 2025

FireFlyer Record file format, writer and reader for DL training samples.

Python 238 24 Updated Dec 1, 2022

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,646 997 Updated Jan 20, 2026

Expert Parallelism Load Balancer

Python 1,334 198 Updated Mar 24, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,909 313 Updated Jan 14, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,109 803 Updated Jan 16, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 8,903 1,067 Updated Jan 20, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,022 940 Updated Jan 20, 2026

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,512 200 Updated May 7, 2025
Next