Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View RogerChern's full-sized avatar

Organizations

@MiroMindAI

Block or report RogerChern

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiroThinker is a series of open-source agentic models trained for deep research and complex tool use scenarios.

Python 1,368 94 Updated Dec 23, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,468 1,999 Updated Nov 1, 2025

Undetected Python version of the Playwright testing and automation library.

Python 1,032 71 Updated Dec 12, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,884 144 Updated Apr 14, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,936 140 Updated Dec 6, 2024

Hardware-synchronized device for FAST-LIVO (Handheld & UAV).

C 698 105 Updated May 19, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,914 94 Updated Aug 15, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 984 36 Updated Nov 25, 2025

A keyboard shortcut browser extension for keyboard-based navigation and tab operations with an advanced omnibar

TypeScript 4,186 289 Updated Apr 14, 2025

Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop Streaming Platform for Self-Hosting, Containers, Kubernetes, or Cloud/HPC

Python 1,258 95 Updated Dec 24, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,288 361 Updated Nov 27, 2025

NVIDIA NCCL Tests for Distributed Training

Shell 129 25 Updated Dec 22, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 19,011 1,301 Updated Oct 21, 2025

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,099 1,071 Updated Oct 29, 2025

Utils for streaming large files (S3, HDFS, gzip, bz2...)

Python 3,420 386 Updated Dec 1, 2025

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 2,396 406 Updated Dec 20, 2025

Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, oral).

Python 868 104 Updated Dec 22, 2024

A unified framework for robot learning

Python 607 95 Updated Nov 26, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,711 240 Updated Aug 15, 2025

DeepSeek LLM: Let there be answers

Makefile 6,676 1,043 Updated Feb 4, 2024

Dobb·E: An open-source, general framework for learning household robotic manipulation

G-code 607 55 Updated Oct 15, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 71,296 8,813 Updated Oct 21, 2025

DeepSeek Coder: Let the Code Write Itself

Python 22,540 2,690 Updated Nov 11, 2025

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 1,075 77 Updated Mar 25, 2023

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 3,093 287 Updated May 3, 2024

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

272 24 Updated Aug 18, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,055 1,098 Updated Dec 23, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,840 582 Updated May 3, 2024

Open-Set Grounded Text-to-Image Generation

Python 2,186 165 Updated Mar 6, 2024
Next