pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation).
Hervé Déjean
Xerox Research Centre Europe
http://www.xrce.xerox.com/About-XRCE/People/Herve-Dejean
Features
- pdf to xml conversion
- text extraction
- vectorial instruction extraction
License
GNU General Public License version 2.0 (GPLv2)Follow pdf2xml
You Might Also Like
One Platform. Total IT Insight. Start with PRTG Now
Why settle for fragmented monitoring? PRTG consolidates everything - servers, VMs, network devices, cloud services, and more, into one powerful platform. Get real-time status, customizable alerts, and deep analytics to drive smarter decisions. Designed for complex environments, PRTG scales with your needs, supports team collaboration, and helps you prevent outages before they impact users. Take control of your IT landscape and deliver the uptime your business requires.
Rate This Project
Login To Rate This Project
User Reviews
-
The link for the SVN code is not working i want to integrate this functionality in my java project , please provide valid link
-
Thanks very good project! +
-
Used on the irs f1040.pdf to produce f1040.xml; however, when viewed in firefox, firefox indicated it had no styling; hence, it didn't look anything like the pdf file when viewed by adobe reader.
-
Very useful, a must-have program. Great job!
-
Simple, no fuss. works for all types