Stars
计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Ongoing research training transformer models at scale
PyTorch Tutorial for Deep Learning Researchers
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
LAVIS - A One-stop Library for Language-Vision Intelligence
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Havenask is a large-scale distributed information search system widely used within Alibaba Group
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
ModelScope: bring the notion of Model-as-a-Service to life.
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Source code for Twitter's Recommendation Algorithm
Source code for the X Recommendation Algorithm
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Your self-hosted, globally interconnected microblogging community
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Multi-thread implementation of Factorization Machines with FTRL for multi-class classification problem which uses softmax as hypothesis.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.