-
jd_ad
- 北京北辰
Stars
[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ongoing research training transformer models at scale
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
《动手学大模型Dive into LLMs》系列编程实践教程
Offers a toolset for comprehensive, multi-faceted large-scale data analysis and optimizations
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Source code for Twitter's Recommendation Algorithm
Source code for the X Recommendation Algorithm
Making large AI models cheaper, faster and more accessible
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Distributed training model(LR, FM) demo using ps-lite. FTRL and SGD Optimization Algorithm.
word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch