I have to implement a Web Cralwer that visits the Linked Data on the Web. I have built a simple functionality for that. I have three queries for that:
- What are the seed URIs I should use. The web sites which provide data in RDF format and follow Tim Berners Lee Linked Data principles ??
- Generally what do you mean by round based approach for Web-Cralwers? I read about general Web Crawlers and found that round based approach should be followed.
- I am able to parse only web pages which can return RDF/XML data .Is it sufficient to crawl the Linked Data.