Thanks to visit codestin.com
Credit goes to github.com

Skip to content

A Firefox and Google Chrome extension to clip websites and download them into a readable and LLM-friendly markdown file.

License

Notifications You must be signed in to change notification settings

phunterlau/llm.md

 
 

Repository files navigation

MarkDownload - Markdown Web Clipper

GitHub release (latest SemVer)

This is an extension to clip websites and download them into a readable markdown file. Please keep in mind that it is not guaranteed to work on all websites.

To use this add-on, simply click the add-on icon while you are browsing the page you want to save offline. A popup will show the rendered markdown so you can make minor edits or copy the text, or you can click the download button to download an .md file.
Selecting text will allow you to download just the selected text

✨ LLM Optimization Feature

MarkDownload now includes an LLM Optimization feature designed to enhance markdown output for AI and Large Language Model processing:

Features:

  • Structured YAML Frontmatter: Automatically adds metadata including title, URL, date, author, excerpt, and tags
  • Clean Links: Removes tracking parameters (utm_*, fbclid, gclid) for cleaner URLs
  • Enhanced Content Structure: Optimizes formatting and whitespace for better AI parsing, including special handling for arXiv's verbose MathML annotations.
  • Conditional Application: Enable/disable via the settings page

How to Enable:

  1. Click the extension icon and then the gear icon to open settings
  2. Check "Optimize Markdown for LLM" option
  3. Save settings

When enabled, your markdown files will include structured metadata and cleaned content optimized for AI processing workflows.

🔧 Manifest V3 Migration

This extension has been updated to use Manifest V3, the latest extension standard for Chrome and Edge browsers:

Technical Improvements:

  • Modern Service Worker Architecture: Replaced background scripts with service workers for better performance and security
  • Offscreen Document Support: DOM operations now use offscreen documents for compatibility with service worker limitations
  • Updated APIs: All deprecated APIs have been updated to their modern equivalents
  • Enhanced Security: Improved permissions model and content security policies
  • Future-Proof: Ensures compatibility with current and future browser versions

For Developers:

  • Background scripts migrated to background-v3.js service worker
  • DOM operations moved to offscreen document (src/offscreen/)
  • Updated from tabs.executeScript to scripting.executeScript
  • Added proper error handling and timeout protection
  • Maintained backward compatibility with all existing features

See the Markdownload User Guide for more details on the functionality of this extension

Installation

The extension is available for Firefox, Google Chrome, Microsoft Edge and Safari.

iTunes App Store

Obsidian Integration

For integration with obsidian, you need to install and enable community plugins named "Advanced Obsidian URI". This plugin help us to bypass character limitation in URL. Because it's using clipboard as the source for creating new file. More information about Advanced Obsidian URI plugin: https://vinzent03.github.io/obsidian-advanced-uri/

You need to do some configurations in order to use this integration.

Steps to follow
  1. Left-Click on the extension
  2. Click on the gear icon to open the configuration menu
  3. Scroll down to "Obsidian integration" section and turn "enable obsidian integration" on.
  4. Fill out the form below (Obsidian vault name and Obsidian folder name.)
  5. Right-click on the extension and open the menu
  6. In "MarkDownload - Markdown Web Clipper", select "Send Tab to Obsidian"

External Libraries

It uses the following libraries:

  • Readability.js by Mozilla version 0.5.0. This library is also used for the Firefox Reader View and it simplifies the page so that only the important parts are clipped. (Licensed under Apache License Version 2.0)
  • Turndown by Dom Christie in version 7.1.3 is used to convert the simplified HTML (from Readability.js) into markdown. (Licensed under MIT License)
  • Moment.js version 2.29.4 used to format dates in template variables

Permissions

  • Data on all sites: used to enable "Download All Tabs" functionality - no other data is captured or sent online
  • Access tabs: used to access the website content when the icon in the browser bar is clicked.
  • Manage Downloads: necessary to be able to download the markdown file.
  • Storage: used to save extension options
  • Clipboard: used to copy Markdown to clipboard

The Common Mark icon courtesy of https://github.com/dcurtis/markdown-mark

Pricing

This is an open-source extension I made for fun. Its intention is to be completely free. It's free on Firefox, Edge and Chrome (and other Chromium browsers), but unfortunately for Safari there is a yearly developer fee, so I've decided to charge a small price for the Safari version to help cover that cost. Alternately, you can become a GitHub Sponsor for as little as $2 per month and you can request a key for the Safari version. Also, even if you're using the free version and you absolutely have to send me money because you like it that much, feel free to throw some coins in my hat via the following:

GitHub Sponsors Paypal

Version History

3.4.0

  • Fixed extra spaces in titles which could cause issues (thanks @rickdoesdev !)
  • Fixed an issue with image paths in some circumstances (thanks @rickdoesdev !)
  • Added parametersizations for "mixed-kebab" and "mixed_snake" which retain original casing but replace spaces (thanks @NSHenry !)
    • Also added a special "obsidian-cal" parameterization which is the same as "mixed-kebab" with duplicate - removed for additional compatibility with the Obsidian Consistent Attachment Location plugin (thanks @NSHenry !)
  • Added lowecase and uppercase options to parameterizations (thanks @redxtech !)
  • Updated Turndown to v7.1.3 (thanks @WeHat !)
  • Updated Readability to v0.5.0 (thanks @WeHat !)
  • Fixed some issues with code block parsing and formatting (thanks @WeHat !)
  • Fixed an issue with some sites missing a proper title (thanks @WeHat !)
  • Fixed an issue with bad base urls causing issues with links in certain circumstances (thanks @WeHat !)
  • Fixed an issue with readability removing everything in certain circumstances (thanks @WeHat !)
  • Send properly configured title to the Obsidian integration (thanks @nekton39 !)
  • Updates to the README (thanks @2gn and @eugenesvk !)

Previous version history is recorded in CHANGELOG.md

About

A Firefox and Google Chrome extension to clip websites and download them into a readable and LLM-friendly markdown file.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 92.6%
  • CSS 4.0%
  • HTML 3.0%
  • Swift 0.4%