-
Meta | UC Berkeley
- California
- rishabhtiwari.org
- @tiwarishabh16
- in/rishabh-tiwari16
-
-
-
tinker-cookbook-custom Public
Forked from devvrit/tinker-cookbook-customPost-training with Tinker
Python Apache License 2.0 UpdatedDec 19, 2025 -
LiveCodeBench Public
Forked from LiveCodeBench/LiveCodeBenchOfficial repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Python MIT License UpdatedDec 17, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LlamaFactoryUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedDec 17, 2025 -
-
open-thoughts Public
Forked from open-thoughts/open-thoughtsFully open data curation for reasoning models
Python Apache License 2.0 UpdatedSep 8, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python Apache License 2.0 UpdatedAug 8, 2025 -
FLAME-MoE Public
Forked from cmu-flame/FLAME-MoEOfficial repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models
Jupyter Notebook UpdatedMay 28, 2025 -
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedApr 18, 2025 -
-
RSD Public
Forked from BaohaoLiao/RSDReward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
Python Apache License 2.0 UpdatedFeb 18, 2025 -
GSM8K-RLVR Public
Forked from Mohammadjafari80/GSM8K-RLVRA simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.
Python UpdatedFeb 6, 2025 -
-
marlin Public
Forked from IST-DASLab/marlinFP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Python Apache License 2.0 UpdatedJan 26, 2025 -
MagicDec Public
Forked from Infini-AI-Lab/MagicDecBreaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
-
llm-awq Public
Forked from mit-han-lab/llm-awq[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedNov 11, 2024 -
-
cords Public
Forked from decile-team/cordsReduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.
Jupyter Notebook MIT License UpdatedFeb 8, 2023 -
cs231n-2019-assignments Public
Solution to CS231n Assignments 2019
-
-
pytorch-segmentation Public
Forked from yassouali/pytorch-segmentation🎨 Semantic segmentation models, datasets and losses implemented in PyTorch.
Python MIT License UpdatedAug 13, 2021 -
InvoiceNet Public
Forked from naiveHobo/InvoiceNetDeep neural network to extract intelligent information from invoice documents.
Python MIT License UpdatedJan 19, 2021 -
Gradient_Starvation Public
Forked from mpezeshki/Gradient_StarvationGradient Starvation: A Learning Proclivity in Neural Networks
Python MIT License UpdatedJan 10, 2021 -
NASDAQ_stock_analysis Public
Forked from morpheu513/NASDAQ_stock_analysisProject Done for the course on Data Analytics UE18CS312
Jupyter Notebook UpdatedDec 1, 2020 -
My-Coding-Solutions Public
This repo contained the solution to coding problems solved by me on different platforms.
-
network-slimming Public
Forked from Eric-mingjie/network-slimmingNetwork Slimming (Pytorch) (ICCV 2017)
Python MIT License UpdatedSep 12, 2020 -
KAGGLE_DAYS_WEBINAR Public
Kaggle Days webinar on Steel Defect Detection Challenge