-
German Center for Open Source AI @gc-os-ai
- Islamabad, Pakistan
-
16:36
(UTC +05:00) - https://orcid.org/0009-0004-3476-2772
- in/armaghan-shakir
- armaghan_shakir
- @armaghan_shakir
- https://www.kaggle.com/sacrum
Highlights
- Pro
Stars
Intelligent automation and multi-agent orchestration for Claude Code
Exploring different features of MCP servers and clients
An Open Source Toolkit For LLM Distillation
Tools for merging pretrained large language models.
Official code for the paper: Depth Anything At Any Condition
Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Official implementation for "SlimDoc: Lightweight Distillation of Document Transformer Models," published in the International Journal on Document Analysis and Recognition (IJDAR), 2025
Convert OCR results into textual representations. The resulting verbalizations can be used as input to an LLM for automated document understanding. Code for the Paper "LAPDoc: Layout-Aware Promptin…
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]
An open-source AI agent that brings the power of Gemini directly into your terminal.
OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871
Model Context Protocol Servers
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Demo of a customer service use case implemented with the OpenAI Agents SDK
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
[CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"
PyTorch code and models for VJEPA2 self-supervised learning from video.
100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.
Lightweight prompt injection detection for Python. Fast, easy integration with LangChain and AutoGen.
Simple UI for debugging correlations of text embeddings
LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
The simplest, fastest repository for training/finetuning small-sized VLMs.
Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)