- Edmonton, Alberta, Canada
- https://mahmoudm007.github.io/
- in/mahmoud-m007
- mahmoud_m007
- @mahmoud_m007
Highlights
- Pro
Stars
This repository contains the Hugging Face Agents Course.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Easily find and view pre-trained AI models and deep learning projects through the command line π»
GLM-4 series: Open Multilingual Multimodal Chat LMs | εΌζΊε€θ―θ¨ε€ζ¨‘ζε―Ήθ―樑ε
A reactive notebook for Python β run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
RT-GENE: Real-Time Eye Gaze and Blink Estimation in Natural Environments
A collection of papers on Diffusion for Image-to-Image Translation and Style Transfer
all of the workflows of n8n i could find (also from the site itself)
verl: Volcano Engine Reinforcement Learning for LLMs
SQL Native Memory Layer for LLMs, AI Agents & Multi-Agent Systems
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)
Hands-on tutorials and code to learn 3D data processing, point clouds, and deep learning for computer vision.
Dialectical reasoning architecture for LLMs (Thesis β Antithesis β Synthesis)
GPT4V-level open-source multi-modal model based on Llama3-8B
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that shoβ¦
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
Soft Robotics Materials Database
Code for the Gaze360: Physically Unconstrained Gaze Estimation in the Wild Dataset
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
A list of developer portfolios for your inspiration
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
Catalogue of portals that maps out roadmap for self learners