Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yanghanxy's full-sized avatar
  • Xiaohongshu
  • Shanghai, China

Block or report yanghanxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估

2,065 442 Updated Dec 17, 2019

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,221 368 Updated Aug 14, 2025

Ongoing research training transformer models at scale

Python 14,932 3,497 Updated Jan 17, 2026

PyTorch Tutorial for Deep Learning Researchers

Python 32,102 8,281 Updated Aug 15, 2023

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,417 287 Updated Jul 17, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,116 1,091 Updated Nov 18, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 37,111 6,132 Updated Nov 10, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,170 395 Updated Jul 11, 2024

Facebook's Hive UDFs

Java 277 150 Updated Dec 15, 2025

Havenask is a large-scale distributed information search system widely used within Alibaba Group

C++ 1,796 336 Updated Nov 3, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,281 4,683 Updated Jan 17, 2026

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

HTML 913 72 Updated Nov 25, 2023

Inference code for Llama models

Python 59,067 9,809 Updated Jan 26, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,975 1,872 Updated Jul 15, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,646 902 Updated Jan 16, 2026

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10,136 781 Updated May 31, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,736 483 Updated Jan 8, 2024

Source code for Twitter's Recommendation Algorithm

Python 10,462 2,235 Updated Jul 10, 2024

Source code for the X Recommendation Algorithm

Scala 70,404 12,948 Updated Sep 8, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,312 987 Updated Jan 16, 2026

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,778 272 Updated Mar 28, 2024

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

C++ 665 190 Updated Jan 13, 2026

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

HTML 4,181 464 Updated Jan 17, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,072 8,747 Updated Nov 12, 2025

Your self-hosted, globally interconnected microblogging community

Ruby 49,507 7,388 Updated Jan 17, 2026

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

C++ 1,200 275 Updated Jan 27, 2022

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,444 1,262 Updated Jan 14, 2026

Multi-thread implementation of Factorization Machines with FTRL for multi-class classification problem which uses softmax as hypothesis.

C++ 71 32 Updated Jun 22, 2021

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,259 878 Updated Jan 13, 2026

Diffusion-LM

Python 1,216 160 Updated Aug 8, 2024
Next