Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View wangzhegeek's full-sized avatar

Organizations

@ARASC

Block or report wangzhegeek

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 706 101 Updated Sep 22, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,882 814 Updated Dec 22, 2025

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 23,587 4,986 Updated Jan 27, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,434 4,699 Updated Jan 28, 2026

Ongoing research training transformer models at scale

Python 15,038 3,539 Updated Jan 28, 2026

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 38,165 4,572 Updated Jan 18, 2026

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 4,413 436 Updated Dec 2, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,495 938 Updated Jan 18, 2026

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,973 2,668 Updated Dec 30, 2025

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 17,200 1,990 Updated Oct 10, 2025

Offers a toolset for comprehensive, multi-faceted large-scale data analysis and optimizations

Python 72 18 Updated Oct 22, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 155,832 31,876 Updated Jan 27, 2026

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,679 1,833 Updated Jun 27, 2024

骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

Jupyter Notebook 3,620 247 Updated Sep 3, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,106 287 Updated Jan 3, 2026

Source code for Twitter's Recommendation Algorithm

Python 10,515 2,247 Updated Jul 10, 2024

Source code for the X Recommendation Algorithm

Scala 72,305 13,200 Updated Sep 8, 2025

Making large AI models cheaper, faster and more accessible

Python 41,331 4,539 Updated Jan 19, 2026

The AI Code Editor

32,144 2,192 Updated Nov 19, 2025

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

58,086 13,593 Updated Jan 1, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,193 2,321 Updated Sep 3, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 3,301 358 Updated Jun 22, 2025

Caffe: a fast open framework for deep learning.

C++ 34,831 18,574 Updated Jul 31, 2024

Distributed training model(LR, FM) demo using ps-lite. FTRL and SGD Optimization Algorithm.

C++ 9 5 Updated Feb 28, 2020

每周五发布,精选优质开发者内容,包括开源项目、工具资源、技术文章等方面。

1,916 145 Updated Dec 23, 2023

word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch

C++ 139 24 Updated Oct 14, 2023

DNN framework based on ps-lite

C++ 30 6 Updated Feb 20, 2021

该仓库主要记录 推荐系统 算法工程师相关的面试题

589 92 Updated Sep 23, 2023
Next