Talking
This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".
🔊 Text-Prompted Generative Audio Model
Source code for the automatic lip-syncing project described in this video! https://www.youtube.com/watch?v=y3B8YqeLCpY
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Simple text to phones converter for multiple languages
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
The world's simplest facial recognition api for Python and the command line
Python script to upload videos on YouTube using Selenium
A lightweight and fast short video processing library based on node.js
Source code for my React Summit talk 2021 - edited with React!
Use Css Keyframes and Animations from animate.css in remotion.
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
A Telegram bot which generates your intro video programmatically 📽️
Fully automated video maker using motion graphics and text-to-speech synthesis to turn newsletters into daily YouTube videos.
Audio visualization components for Remotion.
Programmatic minimalistic audio visualizations.
✨📼Create Reddit Videos with JavaScript📼✨
A fancy Fourier visualizer with RN Skia and Remotion
An attempt to create Fireship Code Report videos in React via Remotion.
Rewind Table is a tool to create programmatic videos using Airtable as data source and Remotion for animation and rendering
Video promotion generated by Remotion used for 'Become a superhero in ESN' event promotion