Thanks to visit codestin.com
Credit goes to github.com

Skip to content
#

rtx-5090

Here are 40 public repositories matching this topic...

From-scratch C++/CUDA inference engine for the NVIDIA RTX 5090 (sm_120a) — the best single-GPU backend for agentic AI: tool calling, long-context loops, reasoning and concurrent sub-agents on top of the fastest single-stream decode on the 5090 (beats llama.cpp, at-or-ahead of vLLM on NVFP4). 100% written by Claude Code.

  • Updated Jul 4, 2026
  • Cuda

异环(Neverness To Everness / Ananta)光线追踪一键部署面板,基于 OptiScaler winmm 方案,默认推荐 RTX 5090,并支持本机/RTX 4090/RTX 5080M 配置、备份、恢复和本地 WebUI。

  • Updated May 18, 2026
  • Python

Local AI coding assistant using Qwen3.6-27B, Ollama, and FastAPI proxy. Built for NVIDIA DGX Spark (GB10) with RTX 5090/4090/3090 GPU support. Powers VS Code Copilot or GitHub Copilot CLI with zero API costs.

  • Updated Jul 4, 2026
  • HTML

Production-grade Traditional Chinese / Taiwan Mandarin speech-to-text. Qwen3-ASR + MediaTek Breeze-ASR-25, hot-word injection, LLM polish, speaker diarization. RTF up to 1554x on RTX 5090, 56 TDD tests.

  • Updated May 7, 2026
  • Python

Improve this page

Add a description, image, and links to the rtx-5090 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rtx-5090 topic, visit your repo's landing page and select "manage topics."

Learn more