tkpham3105

PHAM Trung Kien tkpham3105

AI/CV PhD @ HKUST

16 followers · 19 following

HKUST
Hong Kong | Vietnam
https://tkpham3105.github.io/

Achievements

Stars

HKUST-LongGroup / STAMP

Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction

Python 32 Updated Jan 6, 2026

visual-gen / semanticist

(ICCV 2025) "Principal Components" Enable A New Language of Images

Jupyter Notebook 78 6 Updated Jul 28, 2025

facebookresearch / nwm

Official code for the CVPR 2025 paper "Navigation World Models".

Python 518 46 Updated Nov 24, 2025

dlrudco / Fast-Audioset-Download

Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing

Python 44 Updated Aug 1, 2024

TransDiff / TransDiff

Jupyter Notebook 144 8 Updated Jun 20, 2025

littlewhitesea / training-free-methods

This is a repository to collect training-free algorithms for visual generation and manipulation

200 7 Updated Jan 13, 2026

guoqincode / DiT-Visualization

Visualization of DiT self attention features

Python 235 10 Updated Aug 12, 2024

AssafSinger94 / dino-tracker

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)

Python 548 53 Updated Nov 23, 2024

End2End-Diffusion / REPA-E

[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 446 21 Updated Dec 6, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,209 2,338 Updated Dec 15, 2025

mayu-ot / ltsim

Python 14 3 Updated Mar 24, 2025

VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,301 370 Updated Dec 4, 2025

frank-xwang / InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 607 32 Updated Jun 17, 2025

VideoVerses / VideoTuna

Let's finetune video generation models!

Python 533 29 Updated Sep 15, 2025

EmilianPostolache / stable-audio-controlnet

Fine-tune Stable Audio Open with DiT ControlNet.

Python 249 9 Updated May 16, 2025

TonyLianLong / igligen

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

Python 46 8 Updated Jun 1, 2024

v-iashin / Synchformer

Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)

Python 104 9 Updated Sep 15, 2025

Sreyan88 / GAMA

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Python 150 13 Updated Dec 5, 2024

ollama / ollama

Get up and running with OpenAI GLM-4.7, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 160,669 14,292 Updated Jan 26, 2026

TonyLianLong / LLM-groundedVideoDiffusion

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 164 8 Updated May 7, 2024

lzhangbj / ASVA

[ECCV 2024 Oral] Audio-Synchronized Visual Animation

Python 57 1 Updated Sep 12, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,248 94 Updated Feb 16, 2025

tkpham3105 / TALE

[ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization

Python 20 1 Updated Dec 15, 2024

CASIA-IVA-Lab / VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 297 18 Updated Mar 14, 2024

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,573 205 Updated Sep 29, 2024

forworksmk / forworksmk.github.io

HTML 1 Updated Mar 7, 2024

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,263 113 Updated Jun 27, 2025

Vchitect / Latte

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,908 190 Updated Oct 30, 2025

magic-research / magic-animate

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,898 1,100 Updated Aug 29, 2025

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,894 95 Updated Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PHAM Trung Kien tkpham3105

Achievements

Achievements

Block or report tkpham3105

Stars

HKUST-LongGroup / STAMP

visual-gen / semanticist

facebookresearch / nwm

dlrudco / Fast-Audioset-Download

TransDiff / TransDiff

littlewhitesea / training-free-methods

guoqincode / DiT-Visualization

AssafSinger94 / dino-tracker

End2End-Diffusion / REPA-E

Wan-Video / Wan2.1

mayu-ot / ltsim

VectorSpaceLab / OmniGen

frank-xwang / InstanceDiffusion

VideoVerses / VideoTuna

EmilianPostolache / stable-audio-controlnet

TonyLianLong / igligen

v-iashin / Synchformer

Sreyan88 / GAMA

ollama / ollama

TonyLianLong / LLM-groundedVideoDiffusion

lzhangbj / ASVA

Alpha-VLLM / Lumina-T2X

tkpham3105 / TALE

CASIA-IVA-Lab / VAST

haoheliu / AudioLDM2

forworksmk / forworksmk.github.io

ChenHsing / Awesome-Video-Diffusion-Models

Vchitect / Latte

magic-research / magic-animate

PixArt-alpha / PixArt-sigma