Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View Guan-JW's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Guan-JW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This project aims at building a scalable transactional stream processing engine on modern hardware. It allows ACID transactions to be run directly on streaming data. It shares similar project visio…

C 140 7 Updated May 4, 2025
C++ 15 3 Updated Nov 11, 2025
Python 27 3 Updated Mar 24, 2025

lzbench is an in-memory benchmark of open-source compressors

C 1,040 203 Updated Jan 7, 2026

Zstandard - Fast real-time compression algorithm

C 26,425 2,380 Updated Dec 22, 2025

New generation entropy codecs : Finite State Entropy and Huff0

C 1,460 157 Updated Mar 21, 2024

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 370 30 Updated Sep 25, 2024

LongBench v2 and LongBench (ACL 25'&24')

Python 1,070 116 Updated Jan 15, 2025
Python 13 Updated Nov 6, 2021

A large collection of system log datasets for AI-driven log analytics [ISSRE'23]

2,494 730 Updated Jan 18, 2026

Some quick and dirty Postgres benchmarks

Python 14 1 Updated Feb 27, 2022

Silesia compression corpus

23 5 Updated Sep 2, 2018

Awesome LLM compression research papers and tools.

1,757 116 Updated Nov 10, 2025

Perf monitoring CLI tool for Apple Silicon

Python 4,411 184 Updated Apr 18, 2024

Examples in the MLX framework

Python 8,144 1,123 Updated Dec 15, 2025

MLX: An array framework for Apple silicon

C++ 23,499 1,458 Updated Jan 18, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,151 2,686 Updated Nov 3, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,059 938 Updated Mar 11, 2025

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,873 5,883 Updated Jan 15, 2026

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,675 304 Updated May 21, 2025

FlashInfer: Kernel Library for LLM Serving

Python 4,692 655 Updated Jan 18, 2026

A time-series database for high-performance real-time analytics packaged as a Postgres extension

C 21,440 1,032 Updated Jan 18, 2026

SpotServe: Serving Generative Large Language Models on Preemptible Instances

134 14 Updated Feb 22, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,275 758 Updated Jan 10, 2026

An easy to use PyTorch to TensorRT converter

Python 4,843 696 Updated Aug 17, 2024

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

410 26 Updated Sep 26, 2024

Apache HBase

Java 5,570 3,384 Updated Jan 17, 2026

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,447 1,262 Updated Jan 14, 2026
Next