Youmarizer

Youmarizer is a Python application that extracts and summarizes content from YouTube videos. The application performs the following tasks:

Downloads the video as an audio file.
Processes the audio to transcribe spoken content.
Summarizes the transcribed text.
Extracts key points from the summary.

Features

Download YouTube videos and convert them to audio.
Split audio into manageable chunks for transcription.
Transcribe audio using the Vosk speech recognition model.
Summarize the transcribed text using a transformer-based summarization model.
Extract and display key points from the summary.

Requirements

Python 3.x
Required Python libraries:
- yt-dlp
- pydub
- tqdm
- vosk
- transformers
- colorama

You can install the necessary libraries using pip:

pip install yt-dlp pydub tqdm vosk transformers colorama

Installation

Clone the repository:

git clone https://github.com/yourusername/youmarizer.git

Install ffmpeg: Ensure that ffmpeg is installed on your system. You can download it from FFmpeg’s official website and follow the installation instructions for your operating system.
Download the Vosk model: Download the Vosk model from here and extract it to a directory named vosk-model-small-en-us-0.15 within your project folder.
Install Python dependencies: Make sure you have all the required libraries installed by running:

Usage

Run the application: Execute the main.py script

python main.py
Enter the YouTube video URL when prompted:

Enter the YouTube video URL:

View the results: - The summary will be displayed, and you will be prompted to press Enter to continue. - Key points extracted from the summary will be displayed with bullet points. ``

Project Structure

main.py: The main script to run the application.
audio_processor.py: Contains functions for downloading, splitting, and processing audio.
transcriber.py: Contains functions for speech recognition and transcription.
summarizer.py: Contains functions for text summarization and key point extraction.

Notes

Ensure you have ffmpeg installed for audio processing. You can download it from FFmpeg’s official website.
Adjust the chunk_length_ms parameter in process_audio if you encounter performance issues or need finer control over chunk sizes.

Troubleshooting

Error: ffmpeg not found: Make sure ffmpeg is installed and added to your system’s PATH.
Model not found: Verify that the Vosk model is correctly downloaded and extracted in the specified directory.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
audio_processing.py		audio_processing.py
main.py		main.py
speech_recognition.py		speech_recognition.py
text_processing.py		text_processing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Youmarizer

Features

Requirements

Installation

Usage

Project Structure

Notes

Troubleshooting

License

About

Uh oh!

Releases

Packages

Languages

License

Pradevel/Youmarizer

Folders and files

Latest commit

History

Repository files navigation

Youmarizer

Features

Requirements

Installation

Usage

Project Structure

Notes

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages