dhtcrawler is a DHT crawler written in erlang. It can join a DHT network and crawl many P2P torrents. The program save all torrent info into database and provide an http interface to search a torrent by a keyword.
-
Download mongodb and start it with text search, i.e:
mongod --dbpath db --setParameter textSearchEnabled=true -
Download dhtcrawler source code
-
Use
rebarto download and install all dependent librariesrebar get-deps -
compile
rebar compile -
starting the erlang shell with the ebin path added
erl -pa ebin -
start dhtcrawler
crawler_app:start() -
start the http front-end
crawler_http:start(). -
Open a web browser and point to
localhost:8000/index.html
see priv/dhtcrawler.config.
NOTE, when you change node_count value in dhtcrawler.config, you'd better delete all files saved in dhtstate directory.