Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View rudrashisgorai's full-sized avatar

Highlights

  • Pro

Block or report rudrashisgorai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LinkedIn Profile Data Extraction Tool extracts the data from the LinkedIn Profile once tries to access or review

JavaScript 2 1 Updated Feb 14, 2026

A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic

Python 23,772 1,389 Updated Feb 15, 2026

CUDA Core Compute Libraries

C++ 2,172 343 Updated Feb 15, 2026

NVIDIA tools guide

Cuda 159 10 Updated Jan 7, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,486 985 Updated Feb 6, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,221 1,697 Updated Dec 17, 2025

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,925 116 Updated Feb 13, 2026

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

75,125 8,659 Updated Feb 5, 2026

Easiest and laziest way for building multi-agent LLMs applications.

Python 3,725 361 Updated Feb 14, 2026

What would you do with 1000 H100s...

Jupyter Notebook 1,153 70 Updated Jan 10, 2024

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,492 319 Updated Feb 15, 2026

Best practices & guides on how to write distributed pytorch training code

Python 577 65 Updated Oct 22, 2025

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Go 5,110 1,368 Updated Feb 15, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,281 1,683 Updated Feb 14, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,885 2,100 Updated Feb 15, 2026

Stack trace visualizer

Perl 19,256 2,088 Updated Oct 20, 2024

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,082 829 Updated Dec 22, 2025

Fast CUDA matrix multiplication from scratch

Cuda 1,052 162 Updated Sep 2, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

10,374 776 Updated Jan 21, 2026

Download web video and audio

C# 5,074 233 Updated Feb 15, 2026

High-Resolution 3D Human Digitization from A Single Image.

Python 9,759 1,480 Updated Aug 19, 2024

End-to-End Object Detection with Transformers

Python 15,110 2,663 Updated Mar 12, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,624 4,722 Updated Feb 13, 2026

Implementation of Stable Diffusion with PyTorch

Jupyter Notebook 361 22 Updated Feb 22, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 749 103 Updated Feb 15, 2026

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

252 12 Updated May 6, 2025

A repository to unravel the language of GPUs, making their kernel conversations easy to understand

Python 201 8 Updated Jun 1, 2025

Witness the aha moment of VLM with less than $3.

Python 4,032 285 Updated May 19, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,888 899 Updated Feb 15, 2026

Fast low-bit matmul kernels in Triton

Python 430 31 Updated Feb 1, 2026
Next