Just some notes so I don't forget for now
- Crawl pages (locally/externally by registering url?)
- Deduce structure (form/of microformats
- make tree from it and index?
- store as nosql json blobs?
- Return json blog of queryable fields
- Endpoint to search same
- ???
- Profit