Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Bannng's full-sized avatar

Block or report Bannng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight

Jupyter Notebook 52 1 Updated Feb 12, 2025

Rare-to-Frequent (R2F), ICLR'25, Spotlight

Python 51 Updated Apr 23, 2025

A Survey on Large Language Model-Based Game Agents

736 26 Updated Sep 26, 2025

A cryptocurrency trading API with more than 100 exchanges in JavaScript / TypeScript / Python / C# / PHP / Go

Python 39,487 8,309 Updated Oct 25, 2025

Data used for ACL 2020 paper “None of the Above”:Measure Uncertainty in Dialog Response Retrieval

Python 3 Updated Feb 7, 2021

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 249 32 Updated Apr 23, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 89,985 11,257 Updated Sep 8, 2025

Python wrapper for the arXiv API

Python 1,373 142 Updated Aug 20, 2025

Let's build better datasets, together!

Jupyter Notebook 262 28 Updated Dec 20, 2024

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Python 397 22 Updated Feb 12, 2024

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,518 328 Updated Mar 11, 2025

MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248

Python 57 1 Updated Jun 18, 2024

Grok open release

Python 50,534 8,367 Updated Aug 30, 2024

一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS

Swift 17,718 568 Updated Oct 25, 2025

(Obsolete) Archive of Rant 3.x.

C# 2,956 106 Updated Aug 26, 2020
Python 167 31 Updated Apr 19, 2023

Deep Learning Zero to All - Pytorch

Jupyter Notebook 1,265 1,387 Updated Nov 22, 2020

The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"

Python 309 45 Updated Oct 27, 2023

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 394 56 Updated Apr 13, 2025

Retrieval and Retrieval-augmented LLMs

Python 10,736 804 Updated Oct 22, 2025

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 401 19 Updated May 17, 2024

This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers.

Shell 489 68 Updated Apr 15, 2020

Official inference library for Mistral models

Jupyter Notebook 10,520 979 Updated Mar 20, 2025

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

Python 139 8 Updated Sep 10, 2023

Tools for merging pretrained large language models.

Python 6,399 623 Updated Sep 17, 2025

Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"

Python 78 8 Updated Apr 12, 2023

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Jupyter Notebook 205 65 Updated Jul 19, 2022

BookNLP, a natural language processing pipeline for books

Python 870 115 Updated Jul 31, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,320 5,809 Updated Aug 14, 2024

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Python 2,248 513 Updated Jan 25, 2019
Next