-
CAD&CG State Key Laboratory
- Zhejiang Univ, Hangzhou, Zhejiang, China
Stars
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Scalable toolkit for efficient model alignment
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A bibliography and survey of the papers surrounding o1
Train transformer language models with reinforcement learning.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Distributed platform for building autonomic network functions.
✨✨Latest Advances on Multimodal Large Language Models
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
A curated list for Efficient Large Language Models
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
A curated list of reinforcement learning with human feedback resources (continually updated)
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A toolbox of vision models and algorithms based on MindSpore
MindSpore large-scale recommender system library.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Language-Agnostic SEntence Representations