Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Earthring's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Earthring

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,902 4,235 Updated Dec 14, 2025

Code release for "Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning" (CoRL 2025), https://arxiv.org/abs/2505.04317

Python 19 Updated Nov 19, 2025

GameStream client for PCs (Windows, Mac, Linux, and Steam Link)

C++ 15,503 923 Updated Dec 21, 2025

[NeurIPS 2025 Spotlight] EDELINE: Enhancing Memory in Diffusion-based World Models via Linear-Time Sequence Modeling

Python 12 Updated Oct 18, 2025

Code release for "A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play" (NeurIPS 2025), https://arxiv.org/abs/2502.01932

Python 54 5 Updated Nov 10, 2025

Unified Reinforcement Learning Framework

Python 799 79 Updated Sep 6, 2024

一款提示词优化器,助力于编写高质量的提示词

TypeScript 17,958 2,233 Updated Dec 20, 2025

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

C++ 3,557 465 Updated Dec 11, 2025
Jupyter Notebook 16 Updated Sep 1, 2025

GameStream client for Android

C 6,075 986 Updated Dec 17, 2025

Self-hosted game stream host for Moonlight.

C++ 32,556 1,582 Updated Dec 21, 2025

Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934

Python 170 7 Updated Oct 28, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,064 140 Updated Dec 18, 2025

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

2,713 241 Updated Oct 30, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,761 1,071 Updated Dec 21, 2025

Code for paper "Is Extending Modality The Right Path Towards Omni-Modality?"

Python 13 Updated Jun 3, 2025

A list of awesome papers and resources of recommender system on large language model (LLM).

2,163 156 Updated Mar 17, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,576 257 Updated Aug 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,285 7,791 Updated Dec 21, 2025

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

1,471 80 Updated Oct 7, 2025

Official repository for "Trajectory World Models for Heterogeneous Environments" (ICML 2025), https://arxiv.org/abs/2502.01366

Python 15 Updated Oct 6, 2025

Official repository for "CompilerDream: Learning a Compiler World Model for General Code Optimization" (KDD 2025), https://arxiv.org/abs/2404.16077

Python 7 Updated May 30, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,414 1,152 Updated Apr 30, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,991 205 Updated Dec 21, 2025

About model release for "Sundial: A Family of Highly Capable Time Series Foundation Models" (ICML 2025 Oral)

165 11 Updated Sep 12, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,613 227 Updated Jun 17, 2025

Implementation of all RL algorithms in a simpler way

Jupyter Notebook 2 Updated Apr 9, 2025

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,024 101 Updated Dec 30, 2024

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 376 44 Updated Oct 29, 2025

Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)

Python 425 63 Updated Nov 27, 2024
Next