Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View tiantian0317's full-sized avatar

Block or report tiantian0317

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A project that optimizes Whisper for low latency inference using NVIDIA TensorRT

Python 95 16 Updated Oct 15, 2024

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Python 8,068 1,254 Updated Jul 3, 2024

Industry leading face manipulation platform

Python 26,170 4,184 Updated Dec 20, 2025

RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's long sequence processing capabilities.

Python 53 4 Updated Jul 17, 2025

Mamba SSM architecture

Python 16,771 1,542 Updated Nov 11, 2025

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,556 122 Updated Mar 23, 2025

The Crystal Programming Language

Crystal 20,102 1,655 Updated Dec 20, 2025

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,551 286 Updated Dec 19, 2025

Odin Programming Language

Odin 9,415 865 Updated Dec 20, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,368 2,742 Updated Dec 20, 2025

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 40,318 3,242 Updated Dec 20, 2025

🛸 Optimized Video Native Interface - The fastest video editing GPU-accelerated pipeline.

C 15 Updated Nov 19, 2025

A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥

Python 3,656 276 Updated Nov 12, 2025

Pythonic bindings for FFmpeg's libraries.

Python 3,067 413 Updated Dec 18, 2025

Python library for reading and writing image data

Python 1,671 338 Updated Nov 4, 2025

A summarizer of youtube videos.

Python 44 1 Updated Sep 4, 2025

The cpp inference of BiRefNet based on Tensorrt.

Python 32 2 Updated Jan 17, 2025

BiRefNet Inference using tensorrt

Python 31 3 Updated Aug 30, 2024

Tensorrt implementation for ultra fast face restoration inside ComfyUI

Python 28 4 Updated Sep 22, 2024

Add support for quantization int4 for faster inference.

Python 21 Updated Mar 7, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,115 415 Updated Dec 4, 2025

Ultra fast dwpose estimation inside comfyui using tensorrt ⚡

Python 47 3 Updated May 3, 2025

Go RPC framework with high-performance and strong-extensibility for building micro-services.

Go 7,777 886 Updated Dec 19, 2025

Benchmark GPU inference performance of MobileNetV2: full-precision vs quantized (INT8) models using TensorRT

Python 1 Updated May 12, 2025

C++ TensorRT implementation of Depth-Anything V1, V2

Python 441 50 Updated Mar 7, 2025

This project provides a high-performance image and video upscaler using [RealESRGAN](https://github.com/xinntao/Real-ESRGAN), accelerated with NVIDIA TensorRT. It supports both 2x and 4x upscaling,…

Python 9 1 Updated May 23, 2025

This is the official repository for Fast-nnUNet, a new fast model inference framework based on the nnUNet framework implementation.

Python 15 2 Updated Nov 3, 2025

InsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.

Python 592 135 Updated Jun 1, 2025

ppocrv5, 以TensorRT-v10版本作为推理引擎

Python 8 Updated Jun 17, 2025

This repository demonstrates how to export a pre-trained ResNet18 model to ONNX, and then convert it to a TensorRT engine for fast inference.

Python 1 Updated Jun 27, 2025
Next