Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • StateX

      Public
      The official implementation of the paper "StateX: Enhancing RNN Recall via Post-training State Expansion".
      Python
      0100Updated Oct 24, 2025Oct 24, 2025
    • Official implementation for the paper "KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs"
      Python
      01000Updated Oct 21, 2025Oct 21, 2025
    • AgentRM

      Public
      [ACL 2025 main] AgentRM: Enhancing Agent Generalization with Reward Modeling
      Python
      0400Updated Sep 29, 2025Sep 29, 2025
    • The code of the paper Stuffed Mamba: Oversized States Lead to the Inability to Forget
      Python
      0100Updated Sep 28, 2025Sep 28, 2025
    • BurstEngine is an efficient framework designed to train LLMs on long-sequence data.
      Python
      2700Updated Sep 25, 2025Sep 25, 2025
    • The code for the paper "Cost-Optimal Grouped-Query Attention for Long-Context Modeling"
      Python
      1310Updated Sep 14, 2025Sep 14, 2025
    • Python
      6382610Updated Sep 12, 2025Sep 12, 2025
    • SIR-Bench

      Public
      Python
      0310Updated Sep 12, 2025Sep 12, 2025
    • Seq1F1B

      Public
      Sequence-level 1F1B schedule for LLMs.
      Python
      3.2k3210Updated Aug 26, 2025Aug 26, 2025
    • A LLM-based Agent that predict its tasks proactively.
      Python
      3643250Updated Aug 22, 2025Aug 22, 2025
    • [ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
      Python
      26420Updated Jul 30, 2025Jul 30, 2025
    • FR-Spec

      Public
      [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling
      C++
      24430Updated Jul 15, 2025Jul 15, 2025
    • BlockFFN

      Public
      Source codes for paper "BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity".
      Python
      51700Updated Jul 14, 2025Jul 14, 2025
    • TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
      Python
      98841Updated Jun 14, 2025Jun 14, 2025
    • Python
      0800Updated Jun 11, 2025Jun 11, 2025
    • DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
      Python
      16510Updated Jun 10, 2025Jun 10, 2025
    • Must-read Papers on Textual Adversarial Attack and Defense
      Python
      1951.6k30Updated Jun 4, 2025Jun 4, 2025
    • Python
      0000Updated May 28, 2025May 28, 2025
    • DIET

      Public
      Official code for "The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training"
      Python
      0000Updated May 27, 2025May 27, 2025
    • Migician

      Public
      [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
      Python
      48000Updated May 20, 2025May 20, 2025
    • ToLeaP

      Public
      Python
      1511Updated May 17, 2025May 17, 2025
    • SICOG

      Public
      Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic Cognition
      Python
      23110Updated May 14, 2025May 14, 2025
    • LLaVA-UHD

      Public
      LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer
      Python
      20388100Updated Apr 20, 2025Apr 20, 2025
    • Code for the paper "The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning"
      Python
      0500Updated Apr 8, 2025Apr 8, 2025
    • DeepNote

      Public
      Python
      813010Updated Apr 7, 2025Apr 7, 2025
    • Ouroboros

      Public
      Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
      Python
      911040Updated Mar 20, 2025Mar 20, 2025
    • Learning to Generate STRUCTURED Output with Schema Reinforcement Learning
      Python
      41800Updated Mar 2, 2025Mar 2, 2025
    • APB

      Public
      Official Implementation of APB (ACL 2025 main Oral)
      C++
      33100Updated Feb 22, 2025Feb 22, 2025
    • Evaluate Multimodal LLMs as Embodied Agents
      Python
      45420Updated Feb 14, 2025Feb 14, 2025
    • LEGENT

      Public
      Open Platform for Embodied Agents
      Python
      2333191Updated Jan 12, 2025Jan 12, 2025