Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View solrex's full-sized avatar

Organizations

@JabRef

Block or report solrex

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, composable architecture, reasoning modes, and policy-aware f…

Python 531 73 Updated Dec 18, 2025

A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node

C 60 5 Updated Dec 19, 2025

WonderTrader——量化研发交易一站式框架

C++ 5,718 1,091 Updated Sep 30, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,003 161 Updated Dec 20, 2025

A modern replacement for Redis and Memcached

C++ 29,557 1,123 Updated Dec 22, 2025

Trainable fast and memory-efficient sparse attention

Python 490 46 Updated Dec 23, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,672 751 Updated Dec 23, 2025

A tool to configure, launch and manage your machine learning experiments.

Python 211 87 Updated Dec 23, 2025

Scalable toolkit for efficient model reinforcement

Python 1,163 201 Updated Dec 23, 2025

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 213 36 Updated Dec 20, 2025

HuggingFace conversion and training library for Megatron-based models

Python 301 109 Updated Dec 22, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,345 3,243 Updated Dec 22, 2025

Toward Universal Multimodal Embedding

Python 72 4 Updated Aug 1, 2025

NVIDIA GPUDirect Storage Driver

C 311 52 Updated Dec 18, 2025

Delivers efficient, stable, and secure data distribution and acceleration powered by P2P technology, with an optional content‑addressable filesystem that accelerates OCI container launch.

Go 2,942 359 Updated Dec 22, 2025

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,058 41 Updated Dec 22, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,485 216 Updated Dec 15, 2025

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

Go 9,947 1,070 Updated Dec 22, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,690 309 Updated Nov 13, 2025

[ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"

Python 202 22 Updated Nov 25, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,891 291 Updated Dec 22, 2025

A web-based 3D CAD application for online model design and editing

TypeScript 4,021 354 Updated Dec 22, 2025

All in one project management tool for efficient teams

TypeScript 2,884 278 Updated Nov 20, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,947 357 Updated Dec 23, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,298 296 Updated May 11, 2025

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 643 64 Updated Nov 19, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,511 182 Updated Dec 23, 2025

A Python program that uses tkinter as a UI. It helps organize photos by putting them in folders based on the time they were taken.

Python 2 1 Updated Feb 13, 2021

CUDA Templates for Linear Algebra Subroutines

C++ 1 Updated Jul 8, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,914 3,837 Updated Dec 23, 2025
Next