Preact frontend that reviews pdfs on a domain, % of site urls that are pdf files, average size, whether sample pdf passes accessibility checks. This client side application uses Web Awesome web components and themes for UI, zustand for state management, Vite for build and local development.
Enter a domain and the sitemap will be found and reviewed, if not available a limited crawl will be run to find pdf files
The pdf discovery phase reviews the amount of pdf files compared to the rest of the content and gets the sizes of the first 50 pdf files to get an average size, then an accessibility audit is initiated on the first pdf file
The accessibility audit is run with the Verapdf tool. The code that uses this tool is open source at: https://github.com/ScanGov/verapdf-auditor