Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View wanghqc's full-sized avatar
  • Qualcomm
  • San Diego, CA, USA
  • 11:09 (UTC -08:00)
  • Codestin Search App in/hongqiang

Block or report wanghqc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 191,404 32,629 Updated Feb 13, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,272 1,679 Updated Feb 13, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,842 2,267 Updated Jan 6, 2026

MLX: An array framework for Apple silicon

C++ 23,914 1,511 Updated Feb 12, 2026

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 193 21 Updated Feb 7, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,765 2,034 Updated Jan 13, 2026

LLM inference in C/C++

C++ 20 4 Updated Oct 22, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,525 893 Updated May 12, 2025

Beignet is an open source implementation of the OpenCL specification - a generic compute oriented API. Here is Beignet Source Code Mirror in github- This is a publish-only repository and all pull r…

C++ 101 40 Updated Jan 7, 2023

Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver

C++ 41 43 Updated Feb 12, 2026

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

C++ 4,891 448 Updated Jan 19, 2026

Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 918 158 Updated Feb 12, 2026

pocl - Portable Computing Language

C 1,050 283 Updated Feb 12, 2026

LM Studio CLI

TypeScript 4,188 331 Updated Feb 13, 2026

Microsoft Automatic Mixed Precision Library

Python 636 49 Updated Dec 1, 2025

Print all known information about all available OpenCL platforms and devices in the system

C 371 84 Updated Dec 19, 2025

A comprehensive 10-page probability cheatsheet that covers a semester's worth of introduction to probability.

TeX 3,139 701 Updated Jun 15, 2022

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,200 2,700 Updated Nov 3, 2025

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,731 598 Updated Feb 13, 2026

Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)

337 76 Updated May 28, 2023

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,316 79 Updated Mar 6, 2025

A C++ GPU Computing Library for OpenCL

C++ 1,645 340 Updated Feb 6, 2026

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,963 1,867 Updated Jul 15, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,746 1,307 Updated Apr 6, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 46,975 6,815 Updated Feb 13, 2026

Inference Llama 2 in one file of pure C

C 19,168 2,444 Updated Aug 6, 2024

Inference code for Llama models

Python 59,138 9,825 Updated Jan 26, 2025

A curated list of awesome computer vision resources

23,053 4,431 Updated May 17, 2024

LLM inference in C/C++

C++ 94,967 14,889 Updated Feb 13, 2026
Next