Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View feyzaakyurek's full-sized avatar
👋
👋

Block or report feyzaakyurek

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,736 483 Updated Jan 8, 2024

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

Jupyter Notebook 60 3 Updated Aug 19, 2024
Jupyter Notebook 53 4 Updated May 20, 2024

Code and Data for "Language Modeling with Editable External Knowledge"

Python 36 6 Updated Jun 19, 2024

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,413 105 Updated Mar 3, 2024

Ongoing research training transformer models at scale

Python 14,933 3,497 Updated Jan 17, 2026

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

Python 535 71 Updated Jan 31, 2024

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)

Python 16 1 Updated Jan 18, 2024

[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors

Python 82 12 Updated Dec 21, 2024

Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).

Python 26 1 Updated Aug 25, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,926 12,721 Updated Jan 16, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 38,774 4,189 Updated Jan 16, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,148 2,686 Updated Nov 3, 2025

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Jupyter Notebook 118 12 Updated Sep 12, 2024

Code for fine-tuning Platypus fam LLMs using LoRA

Python 630 58 Updated Feb 4, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,830 232 Updated Aug 11, 2024
Python 11 2 Updated Apr 23, 2023

Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model

Python 71 6 Updated Nov 1, 2022

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,271 4,025 Updated Jul 17, 2024

Extract addresses and intents from tweet texts

Python 38 5 Updated Feb 17, 2023

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

Jupyter Notebook 1,012 82 Updated Dec 16, 2024

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,537 2,042 Updated Dec 18, 2025

Google Research

Jupyter Notebook 46 9 Updated Oct 29, 2022

A modular RL library to fine-tune language models to human preferences

Python 2,374 202 Updated Mar 1, 2024

A Collection of BM25 Algorithms in Python

Python 1,293 101 Updated Oct 8, 2024

Automatic metrics for GEM tasks

Python 67 20 Updated Oct 25, 2022
Python 671 87 Updated Nov 1, 2024
Python 2,937 336 Updated Jan 15, 2026

The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.

HTML 12 Updated Dec 15, 2021

Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".

Python 19 1 Updated Aug 30, 2022
Next