Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Ben…

Jupyter Notebook 13 1 Updated May 21, 2025

Fantasy-AMAP / fantasy-talking

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,577 123 Updated Aug 20, 2025

KwaiVGI / ReCamMaster

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,556 72 Updated Oct 23, 2025

dokkaner / teemii

A versatile, self-hosted manga reader and manager with extensible agent-based metadata retrieval

JavaScript 430 39 Updated Mar 8, 2024

changsn / SparseDiT

NeurIPS 2025

Python 9 1 Updated Sep 24, 2025

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 22,597 2,553 Updated Oct 8, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,029 1,532 Updated Oct 16, 2025

hkchengrex / MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,915 222 Updated Sep 24, 2025

ali-vilab / VACE

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,369 243 Updated Oct 17, 2025

neosr-project / neosr

neosr is an open-source framework for training super-resolution models.

Python 280 44 Updated Jun 2, 2025

coder / code-server

VS Code in the browser

TypeScript 74,421 6,306 Updated Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vlad Petrenko HarewVlad

Achievements

Achievements

Block or report HarewVlad

Stars

PatrickJS / awesome-cursorrules

Agent-on-the-Fly / Memento

danakt / russian-words

voicekit-team / T-one

Textualize / rich

stanfordnlp / dspy

sgl-project / sglang

speaches-ai / speaches

magenta / magenta-realtime

aigc-apps / VideoX-Fun

Zehong-Ma / MagCache

openai / openai-agents-python

NoM0Re / WoW-3.3.5a-Addons

hao-ai-lab / FastVideo

bytedance / ATI

yl4579 / PitchExtractor

yl4579 / AuxiliaryASR

huggingface / parler-tts

getzep / graphiti

MahtaFetrat / LLM-Powered-G2P