Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View bbuing9's full-sized avatar

Block or report bbuing9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[EMNLP 25] Personalized Language Models via Privacy-Preserving Evolutionary Model Merging

Python 1 Updated Nov 4, 2025

Official implementation of Wan et al's paper "Everyone's Voice Matters: Quantifying Annotation Disagreement Using Demographic Information" (AAAI 2023)

Jupyter Notebook 6 1 Updated Jan 17, 2023

Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).

Jupyter Notebook 12 Updated Aug 6, 2024

Official implementation of Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning (NeurIPS 2024).

Python 26 2 Updated Mar 4, 2025
Python 2 Updated Oct 15, 2025
Python 7 Updated Sep 17, 2025
Python 2 Updated Nov 2, 2025
Jupyter Notebook 3 Updated Sep 29, 2025
Python 17 Updated Jul 1, 2025

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Python 247 14 Updated Feb 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,341 809 Updated Oct 31, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,403 64 Updated Mar 16, 2025
Python 4,161 445 Updated Jul 31, 2025
Python 22 2 Updated Jan 17, 2025

Lightweight Adapting for Black-Box Large Language Models

Python 24 5 Updated Feb 15, 2024

Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).

Python 42 4 Updated Aug 6, 2024

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 760 66 Updated Apr 7, 2023

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 2,234 210 Updated May 25, 2024

Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)

Python 70 5 Updated Aug 3, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,067 122 Updated Jun 1, 2023

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,755 143 Updated Aug 4, 2024

Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)

Python 26 1 Updated May 7, 2024

Parkar and Kim et al.'s paper on :SelectLLM: Can LLMs Select Important Instructions to Annotate?"

Python 12 1 Updated Jul 4, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,896 287 Updated Aug 9, 2025

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 402 19 Updated May 17, 2024

Codes for papers on Large Language Models Personalization (LaMP)

Python 175 10 Updated Feb 18, 2025

Code for the paper "RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training" (EMNLP 2023)

Python 8 Updated Jan 13, 2024
Jupyter Notebook 116 17 Updated May 2, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,774 230 Updated Aug 11, 2024
Next