- Lahore, Pakistan
- llcuda.github.io
- in/mohammad-waqas-3a1384270
- @waqasm86
Lists (3)
Sort Name ascending (A-Z)
Stars
- All languages
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Cuda
- Cython
- Dockerfile
- Elixir
- Go
- Go Template
- Groovy
- HCL
- HTML
- Handlebars
- Haskell
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Lua
- MDX
- MLIR
- Makefile
- Markdown
- Mustache
- PHP
- PLpgSQL
- PowerShell
- Pug
- Python
- R
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smarty
- Starlark
- Svelte
- Swift
- TSQL
- TypeScript
- VHDL
- Vue
- YAML
CUDA inference backend for Unsloth - Tesla T4 optimized with FlashAttention, Tensor Cores, and native Python API
GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal.
Gemini auth plugin for opencode
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
Streamlit Component for rendering Folium maps
An ensemble approach to accurately detect somatic mutations using SomaticSeq
LLMRouter: An Open-Source Library for LLM Routing
Svelte Kit Tutorial with Remote Functions, Async, Drizzle, Better Auth
Intelligent textbook for a 15 week high-school physics course.
How to use AU and mkdocs material with education theory to create intelligent textbooks
NVIDIA curated collection of educational resources related to general purpose GPU programming.
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
A high-performance, asynchronous toolkit for building MCP servers and clients in Rust.
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Client application of the UXI Group Studies infrastructure for conducting user studies with eye tracking.
tunguz / tiny-cuda-nn
Forked from NVlabs/tiny-cuda-nnLightning fast C++/CUDA neural network framework
NVIDIA Networking NIC Configuration Operator For Kubernetes
An open source flash player implementation