- beijing
- http://blog.csdn.net/hzh_0000
Stars
Train transformer language models with reinforcement learning.
Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"
This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Fast inference from large lauguage models via speculative decoding
Ongoing research training transformer models at scale
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Fast and memory-efficient exact attention
🍺 The missing package manager for macOS (or Linux)
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
A simple, fast and user-friendly alternative to 'find'
ClickHouse® is a real-time analytics database management system
An open protocol enabling communication and interoperability between opaque agentic applications.
A bridge between Streamable HTTP and stdio MCP transports