Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View paperkaiser's full-sized avatar

Block or report paperkaiser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 16,124 1,344 Updated Nov 12, 2025

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,326 579 Updated Nov 10, 2025

Curated tutorials and resources for Large Language Models, AI Painting, and more.

4,387 297 Updated Mar 31, 2024

Reinforcement Learning in PyTorch

Python 2,267 330 Updated Jan 4, 2021

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Cython 3,085 826 Updated Dec 10, 2023

A collection of reference environments for offline reinforcement learning

Python 1,609 302 Updated Nov 18, 2024

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,767 8,709 Updated Oct 11, 2024

Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games

Python 557 75 Updated Jun 26, 2023

Collection of reinforcement learning algorithms

Python 2,808 565 Updated Jun 17, 2024

Code for conservative Q-learning

Python 462 76 Updated Dec 7, 2021

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,542 416 Updated Oct 22, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,818 6,900 Updated Nov 14, 2025

Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)

Python 60 6 Updated Apr 29, 2024

Papers on Computational Advertising

Python 4,358 1,195 Updated Feb 9, 2021

Advantage weighted Actor Critic for Offline RL

Python 50 8 Updated Aug 27, 2022

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,484 6,139 Updated Sep 18, 2024

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,601 242 Updated Nov 13, 2025

Source code for the X Recommendation Algorithm

Scala 67,769 12,627 Updated Sep 8, 2025

Source code for Twitter's Recommendation Algorithm

Python 10,399 2,232 Updated Jul 10, 2024

AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo

11,690 948 Updated Aug 14, 2024

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,682 778 Updated Nov 27, 2024

OpenMMLab Rotated Object Detection Toolbox and Benchmark

Python 2,055 620 Updated Sep 28, 2024

[CVPR2022] DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Python 433 38 Updated Sep 27, 2024

[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Python 962 139 Updated Jul 18, 2023
Python 68 11 Updated Oct 23, 2020

A research project for text detection and recognition using PyTorch 1.2.

Python 349 67 Updated Dec 24, 2019

A model compression and acceleration toolbox based on pytorch.

Python 332 40 Updated Jan 12, 2024

EnsembleMOT: A Step towards Ensemble Learning of Multiple Object Tracking

Python 14 2 Updated Jan 22, 2024

OpenMMLab Rotated Object Detection Toolbox and Benchmark

Python 1 1 Updated Oct 11, 2022
Next