Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View 2U1's full-sized avatar

Highlights

  • Pro

Block or report 2U1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…

Python 62 1 Updated Oct 17, 2025

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 20,911 1,942 Updated Oct 24, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,898 518 Updated Oct 22, 2025

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 16,522 1,153 Updated Oct 4, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 19,556 2,662 Updated Oct 24, 2025

한국어 데이터 세트 링크

896 103 Updated Oct 14, 2024

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,750 1,008 Updated Sep 24, 2025

Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)

Python 312 27 Updated Oct 2, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,881 10,744 Updated Oct 24, 2025

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

Rust 507 26 Updated Oct 23, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,328 222 Updated Aug 28, 2025

[CVPR 2023] "PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation" official implementation.

Python 341 39 Updated Feb 9, 2025

MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;

Jupyter Notebook 283 52 Updated May 26, 2022

Real-time Action detection demo for the work Actor Conditioned Attention Maps. This repo includes a complete pipeline for person detection/tracking and analyzing their actions in real-time.

Python 152 38 Updated Dec 8, 2022

Demo of a customer service use case implemented with the OpenAI Agents SDK

TypeScript 5,840 894 Updated Aug 25, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,076 11,194 Updated Oct 22, 2025

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, TMLR2025]

97 3 Updated Jun 16, 2025

[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,086 80 Updated Oct 16, 2025

A python module to repair invalid JSON from LLMs

Python 3,619 146 Updated Oct 22, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,027 367 Updated Oct 21, 2025

Awesome list for LLM quantization

Python 328 20 Updated Oct 11, 2025

[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models

Python 104 1 Updated May 29, 2025

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 10,165 855 Updated Oct 12, 2025

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 24,970 2,269 Updated Jul 11, 2024

Kernels & AI inference engine for phones

C++ 3,510 200 Updated Oct 23, 2025

Vision-Language Model Emergency Recognition Evaluation

Python 4 1 Updated May 23, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 4,796 764 Updated May 12, 2025

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Python 290 23 Updated Apr 29, 2025

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

Python 893 163 Updated Oct 28, 2024

Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.

Python 34,550 4,525 Updated Oct 24, 2025
Next