Glim - YouTube Video Summarizer

dirname: Name of the output directory (default: 'output')

Use the --pdf flag to also generate a PDF version

Use the --theme <theme> flag to select an output theme (default: 'default')

Use the --lang <lang> flag to specify the output language (default: 'en')"800"/>

A powerful Node.js application that transforms YouTube videos into well-organized, easy-to-understand summaries. Glim processes videos to extract main topics, generate thought-provoking questions, and provide simple ELI5 (Explain Like I'm 5) answers in a beautifully formatted HTML document.

Features

Extract transcript from YouTube videos
Identify main topics from the video content
Generate thought-provoking questions for each topic
Create simple, easy-to-understand answers using HTML formatting
Generate a well-formatted HTML output file
PDF export capability for easy sharing and printing
Configurable AI provider settings
Built on the PocketFlow framework for modular processing

Prerequisites

Node.js (v16 or higher)
API key for your preferred AI provider (Google Gemini by default)

Installation

Clone the repository
Install dependencies:

npm install

Usage

Basic Usage

Run the application with your API key and provide a YouTube URL:

GOOGLE_API_KEY="your-api-key" node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID"

Or run it and enter the URL when prompted:

GOOGLE_API_KEY="your-api-key" node src/index.js

Additional Options

Create a default configuration file:

node src/index.js --config

Generate a PDF version of the output:

node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID" --pdf

Select a specific theme:

node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID" --theme material

Set the output language:

node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID" --lang fr

Specify an AI provider directly:

node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID" --provider openai --api-key "your-api-key"

Processing Steps

When you run Glim, it will:

Extract the video transcript
Identify the main topics
Generate questions for each topic
Create simple ELI5 answers
Generate an HTML output file (and optionally PDF)

Configuration

Glim can be configured via a YAML configuration file. Create a default config with:

node src/index.js --config

This will create a config.yml file with detailed examples for all providers and the following structure:

# Glim Configuration File
# =====================

# API Provider Settings
# Provider options: google, openai, anthropic, localai
# See README.md for details on setting up each provider

api:
  provider: google
  key: ""
  model: gemini-2.0-flash
  temperature: 0.7
  max_tokens: 4000
  endpoint: http://localhost:11434/api/generate
  _examples:
    google:
      provider: google
      key: YOUR_GOOGLE_API_KEY
      model: gemini-2.0-flash
    openai:
      provider: openai
      key: YOUR_OPENAI_API_KEY
      model: gpt-4o
    anthropic:
      provider: anthropic
      key: YOUR_ANTHROPIC_API_KEY
      model: claude-3-sonnet-20240229
    localai:
      provider: localai
      endpoint: http://localhost:11434/api/generate
      model: llama3

content:
  maxTopics: 5
  maxQuestionsPerTopic: 3
  codeLang: en
  theme: default

output:
  dirname: output

You can also set configuration via environment variables, which take precedence over the config file.

Changing AI Provider

Glim now supports multiple AI providers. You can configure your preferred provider in the config.yml file:

Google Gemini

api:
    provider: google
    key: your-api-key
    model: gemini-2.0-flash

Obtain a Google API key from Google AI Studio
Set it in your environment variables: export GOOGLE_API_KEY="your-api-key"
Or update the key field in your config.yml file

OpenAI

api:
    provider: openai
    key: your-api-key
    model: gpt-4o
    temperature: 0.7

Install the OpenAI package: npm install openai
Obtain an API key from OpenAI Platform
Set it in your environment variables: export OPENAI_API_KEY="your-api-key"
Or update the key field in your config.yml file

Anthropic Claude

api:
    provider: anthropic
    key: your-api-key
    model: claude-3-sonnet-20240229
    max_tokens: 4000

Install the Anthropic package: npm install @anthropic-ai/sdk
Obtain an API key from Anthropic Console
Set it in your environment variables: export ANTHROPIC_API_KEY="your-api-key"
Or update the key field in your config.yml file

Local LLM (Ollama, LocalAI, etc.)

api:
    provider: localai
    model: llama3
    endpoint: http://localhost:11434/api/generate
    temperature: 0.7
    max_tokens: 2000

Set up and run a local LLM server like Ollama or LocalAI
Configure the endpoint URL in your config.yml file

You can easily extend Glim with additional AI providers by modifying the callLLM.js file.

Customizing Content Generation

You can adjust the following parameters in your config.yml:

maxTopics: Maximum number of topics to extract (default: 5)
maxQuestionsPerTopic: Maximum questions per topic (default: 3)
codeLang: Language code for transcript extraction (default: 'en')

Output Customization

dirname: Name of the output directory (default: 'output')
Use the --pdf flag to also generate a PDF version
Use the --theme <theme> flag to select an output theme (default: 'default')
Use the --lang <lang> flag to specify the output language (default: 'en')

HTML Themes

Glim offers several beautifully crafted themes for your video summaries:

Default - Clean, minimalist design with soft shadows and modern typography
Neomorphism - Soft UI design with subtle shadows and clean interfaces
Glassmorphism - Modern glass-like transparent UI elements
RetroY2K - Nostalgic design inspired by early 2000s web aesthetics
Hacker - Terminal-style dark theme with monospace fonts
Typography - Focus on beautiful typography with perfect readability
Maximalist - Bold, vibrant design with decorative elements
Brutalist - Raw, bold design with high contrast and visible construction
Flat - Clean, minimalist design without shadows or depth
Minimalist - Ultra-simple design focused on content
Material - Following Material Design principles with cards, elevation and color system

Select a theme using the --theme flag:

node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID" --theme brutalist

Example outputs of different themes can be found in the output directory.

Project Structure

src/index.js - Main application entry point
src/flow.js - Flow creation and orchestration using PocketFlow
src/pocketflow.js - PocketFlow framework integration
src/nodes.js - Processing nodes for the video summarization pipeline
src/utils/ - Utility functions:
- callLLM.js - Multi-provider LLM API interaction
- youtubeProcessor.js - YouTube video processing
- htmlGenerator.js - HTML output generation
- logger.js - Enhanced logging functionality
- pdfConvertor.js - PDF conversion utilities

Component Architecture

The following diagram illustrates the components and their relationships in the Glim application:

classDiagram
    class GlimApplication {
        +main()
        +processVideo(url)
        +generateSummary()
        +outputResults()
    }

    class ConfigManager {
        +loadConfig()
        +getAPISettings()
        +getOutputSettings()
    }

    class YouTubeProcessor {
        +extractVideoId(url)
        +getVideoInfo(videoId)
        +getTranscript(videoId)
    }

    class LLMProvider {
        +callLLM(prompt)
        +callGemini(prompt)
        +callOpenAI(prompt)
        +callAnthropic(prompt)
        +callLocalAI(prompt)
    }

    class HTMLGenerator {
        +htmlGenerator(title, imageUrl, sections)
        +generateTOC()
    }

    class PDFConverter {
        +convertToPDF(html)
    }

    class Logger {
        +info(message)
        +error(message)
        +warn(message)
        +debug(message)
    }

    GlimApplication --> ConfigManager : uses
    GlimApplication --> YouTubeProcessor : uses
    GlimApplication --> LLMProvider : uses
    GlimApplication --> HTMLGenerator : uses
    GlimApplication --> PDFConverter : uses
    GlimApplication --> Logger : uses

    LLMProvider ..> ConfigManager : reads config
    YouTubeProcessor ..> Logger : logs events
    LLMProvider ..> Logger : logs events

How Glim Uses PocketFlow

Glim implements a video summarization pipeline using PocketFlow's node-based architecture:

Workflow Diagram

flowchart TD
    A[YouTube URL] -->|Extract| B[Video Transcript]
    B -->|Analyze| C[Topic Detection]
    C -->|Generate| D[Questions]
    D -->|Answer| E[ELI5 Answers]
    E -->|Format| F[Generate HTML]
    F -->|Optional| G[Convert to PDF]

    subgraph AIProviders[AI Providers]
        P1[Google Gemini]
        P2[OpenAI]
        P3[Anthropic Claude]
        P4[LocalAI]
    end

    AIProviders -.->|Selected Provider| C
    AIProviders -.->|Selected Provider| D
    AIProviders -.->|Selected Provider| E

    subgraph Config[Configuration]
        C1[config.yml]
        C2[Environment Variables]
    end

    Config -.->|Settings| AIProviders

    style A fill:#ffcccb,stroke:#333,stroke-width:2px
    style B fill:#c2e0ff,stroke:#333,stroke-width:2px
    style C fill:#d4f7c9,stroke:#333,stroke-width:2px
    style D fill:#ffecb3,stroke:#333,stroke-width:2px
    style E fill:#e1bee7,stroke:#333,stroke-width:2px
    style F fill:#b2dfdb,stroke:#333,stroke-width:2px
    style G fill:#b2dfdb,stroke:#333,stroke-width:2px,stroke-dasharray: 5 5

Processing Nodes

Input Node: Handles YouTube URL validation and processing
Extraction Node: Extracts video transcript using YouTube API
Analysis Node: Sends content to the selected LLM provider for topic extraction
Question Generation Node: Creates questions for each identified topic
Answer Generation Node: Produces ELI5 answers for each question
Output Node: Formats results into HTML/PDF using templates

This modular approach makes it easy to:

Swap LLM providers without changing the overall flow
Add new processing steps (e.g., translation, keyword extraction)
Customize output formats while reusing the core processing logic

Sequence Diagram

The following diagram illustrates the execution sequence and data flow between components:

sequenceDiagram
    participant User
    participant CLI as Command Line Interface
    participant Config as Configuration Manager
    participant YT as YouTube Processor
    participant LLM as LLM Provider
    participant Generator as HTML Generator
    participant PDF as PDF Converter

    User->>CLI: Inputs YouTube URL
    CLI->>Config: Load configuration
    Config-->>CLI: Provider settings
    CLI->>YT: Process video
    YT->>YT: Extract transcript
    YT-->>CLI: Video info & transcript
    CLI->>LLM: Request topic analysis
    LLM-->>CLI: Main topics

    loop For each topic
        CLI->>LLM: Generate questions
        LLM-->>CLI: Topic questions
        CLI->>LLM: Generate ELI5 answers
        LLM-->>CLI: Simple answers
    end

    CLI->>Generator: Create HTML summary
    Generator-->>CLI: HTML document

    alt PDF Output Requested
        CLI->>PDF: Convert HTML to PDF
        PDF-->>CLI: PDF document
    end

    CLI-->>User: Output files

About PocketFlow Framework

Pocket Flow – 100-line minimalist LLM framework

Glim is built on the PocketFlow framework, a minimalist 100-line LLM framework designed for building modular, efficient processing flows. PocketFlow provides:

Lightweight Architecture: Just 100 lines of core code with zero dependencies and no vendor lock-in
Flexible Graph-Based Processing: Build complex data processing workflows with simple, reusable nodes
Rich Design Patterns: Support for Agents, Multi-Agent Systems, Workflows, RAG (Retrieval Augmented Generation), and more
Error Handling: Built-in error management and automatic retries
Extensibility: Easy to customize and extend with your own processing nodes

PocketFlow's minimalist approach perfectly complements Glim's video summarization pipeline, allowing for:

Modular processing of YouTube video content
Flexible integration of multiple LLM providers
Easy extension for new features and output formats

Learn more about the PocketFlow framework at: https://github.com/The-Pocket/PocketFlow

Development

Want to modify or extend Glim? Here's how to set up your development environment:

Clone the repository:

git clone https://github.com/yourusername/glim.git
cd glim

Install dependencies:
```
npm install
```
Create a config file with your API key:
```
node src/index.js --config
```
Then edit the config.yml file to add your API key.
Make your changes to the codebase.

Test your changes:

GOOGLE_API_KEY="your-api-key" node src/index.js --url "https://www.youtube.com/watch?v=VIDEO_ID"

Adding a New AI Provider

To add support for a different AI model or provider:

Create a new file in src/utils/ for your provider, for example openAIProvider.js.
Implement the API call function similar to the existing callLLM.js.
Update the config.js file to include your new provider in the options.
Modify the code in the nodes that call the LLM to use your new provider when selected.

Troubleshooting

API Key Issues

Error: "API key not found"
- Solution: Make sure your API key is set in the environment variable or in the config.yml file.

YouTube Video Access

Error: "Could not extract video transcript"
- Solution: Make sure the video is public and has captions/subtitles available.

PDF Generation

Error: "Failed to write output file as pdf"
- Solution: Ensure you have Puppeteer installed correctly and enough system resources.

Theme Issues

Error: "Theme not found" or "Rendering error"
- Solution: Verify the theme name is correct and matches one of the available options.

License

MIT

Made with ❤️ by Bagi

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Glim - YouTube Video Summarizer

Features

Prerequisites

Installation

Usage

Basic Usage

Additional Options

Processing Steps

Configuration

Changing AI Provider

Google Gemini

OpenAI

Anthropic Claude

Local LLM (Ollama, LocalAI, etc.)

Customizing Content Generation

Output Customization

HTML Themes

Project Structure

Component Architecture

How Glim Uses PocketFlow

Workflow Diagram

Processing Nodes

Sequence Diagram

About PocketFlow Framework

Development

Adding a New AI Provider

Troubleshooting

API Key Issues

YouTube Video Access

PDF Generation

Theme Issues

License

About

Uh oh!

Languages

License

Hassan-ach/glim

Folders and files

Latest commit

History

Repository files navigation

Glim - YouTube Video Summarizer

Features

Prerequisites

Installation

Usage

Basic Usage

Additional Options

Processing Steps

Configuration

Changing AI Provider

Google Gemini

OpenAI

Anthropic Claude

Local LLM (Ollama, LocalAI, etc.)

Customizing Content Generation

Output Customization

HTML Themes

Project Structure

Component Architecture

How Glim Uses PocketFlow

Workflow Diagram

Processing Nodes

Sequence Diagram

About PocketFlow Framework

Development

Adding a New AI Provider

Troubleshooting

API Key Issues

YouTube Video Access

PDF Generation

Theme Issues

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages