What is the best Open Source Web Crawler Tool, written in Java.
Asked
Active
Viewed 1.5k times
2 Answers
10
Try crawler4j. You just need to implement a simple interface which controls which URLs to visit and what to do with each crawled page.

Scott Wardlaw
- 652
- 1
- 8
- 13

Andy
- 8,870
- 1
- 31
- 39
-
I have problems crawling HTTPS websites using this crawler ("site failed to respond" while it opens fine in browser etc.) – ed22 Sep 29 '17 at 08:29