Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View RuyangFan's full-sized avatar

Block or report RuyangFan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 72,175 10,561 Updated Jun 18, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,900 4,681 Updated Aug 19, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,245 3,883 Updated Jul 23, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,881 2,553 Updated Mar 13, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,955 989 Updated Jan 11, 2026

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,511 499 Updated Mar 22, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,404 412 Updated Jun 28, 2024

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

Jupyter Notebook 3,598 448 Updated Oct 25, 2023

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

Jupyter Notebook 3,268 577 Updated Oct 1, 2022

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 2,820 316 Updated May 1, 2024

pytorch implementation of openpose including Hand and Body Pose Estimation.

Jupyter Notebook 2,301 417 Updated Jul 9, 2024

[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer

Jupyter Notebook 1,691 257 Updated Mar 6, 2023

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Jupyter Notebook 1,604 144 Updated Aug 15, 2024

Official PyTorch repo for JoJoGAN: One Shot Face Stylization

Jupyter Notebook 1,440 204 Updated Sep 29, 2022

VOLO: Vision Outlooker for Visual Recognition

Jupyter Notebook 951 95 Updated Sep 18, 2022

This is the PyTorch implementation of paper Real-time Facial Surface Geometry from Monocular Video on Mobile GPUs (https://arxiv.org/pdf/1907.06724.pdf)

Jupyter Notebook 303 63 Updated Jun 12, 2020

Generate broll for a video using AI

Jupyter Notebook 98 23 Updated Jan 11, 2025

Official repository of Manga109Dialog (ICME 2024)

Jupyter Notebook 26 2 Updated Aug 3, 2024

Training code for FAN

Jupyter Notebook 19 4 Updated Apr 11, 2022
Jupyter Notebook 9 1 Updated Oct 11, 2023

This GitHub repository contains image attributes for a dataset of free-use stock photos.

Jupyter Notebook 8 Updated Feb 14, 2023