I'm not quite sure whether this is the suitable forum to post my question. I'm analyzing web server logs both in Apache and IIS log formats. I want to find the evidences for automatic browsing(Ex. Web robots,spiders,bots etc.) I used python robot-detection 0.2.8 for detecting robots in my log files. Anyway there may be other robots(automatic programs) which have traversed through the web site but robot-detection can not identify.
- So are there any specific clues that can be found in log files(that human users do not perform but software perform actions etc)?
- Do they follow a specific navigation pattern?
- I saw some requests for favicon.ico? Does this implicate that it is a automatic browsing?.
I found this article with some valuable points.