-
Carnegie Mellon University
- Pittsburgh, PA
Lists (2)
Sort Name ascending (A-Z)
Stars
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
A simple, performant and scalable Jax LLM!
A framework for few-shot evaluation of language models.
A repository of links with advice related to grad school applications, research, phd etc
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Transform a multicolor image into embroidery patterns that you can make on a sewing machine!
Retrieval-Augmented Generation (RAG) on 17M full text journal articles.
A fast, clean, responsive Hugo theme.
A guidance language for controlling large language models.
A curated list of recent diffusion models for video generation, editing, and various other applications.
📱 Collaborative List of Open-Source iOS Apps
A python library to compute the graph Ricci curvature and Ricci flow on NetworkX graph.
Analyzing CS graduate admission records in the US (2009-2016) to predict admission decisions.
A collection of robotics simulation environments for reinforcement learning
Load YouTube videos with the HTLML5 <video> element without needing iframes or the YouTube JS API.
ROS Navigation stack. Code for finding where the robot is and how it can get somewhere else.
Differential Programming/Differentiable Programming
CohenQU / husarnet
Forked from husarnet/husarnetHusarnet is a Peer-to-Peer VPN to connect your laptops, servers and microcontrollers over the Internet with zero configuration.