Is it worthwhile to block malicious crawlers via iptables?

Question

I periodically check my server logs and I notice a lot of crawlers search for the location of phpmyadmin, zencart, roundcube, administrator sections and other sensitive data. Then there are also crawlers under the name "Morfeus Fucking Scanner" or "Morfeus Strikes Again" searching for vulnerabilities in my PHP scripts and crawlers that perform strange (XSS?) GET requests such as:

GET /static/)self.html(selector?jQuery(
GET /static/]||!jQuery.support.htmlSerialize&&[1,
GET /static/);display=elem.css(
GET /static/.*.
GET /static/);jQuery.removeData(elem,

Until now I've always been storing these IPs manually to block them using iptables. But as these requests are only performed a maximum number of times from the same IP, I'm having my doubts if it does provide any advantage security related by blocking them.

I'd like to know if it does anyone any good to block these crawlers in the firewall, and if so if there's a (not too complex) way of doing this automatically. And if it's wasted effort, maybe because these requests come from from new IPs after a while, if anyone can elaborate on this and maybe provide suggestion for more efficient ways of denying/restricting malicious crawler access.

FYI: I'm also already blocking w00tw00t.at.ISC.SANS.DFind:) crawls using these instructions: http://spamcleaner.org/en/misc/w00tw00t.html

If you use apache have a look at mod_security – Eddy May 11 '10 at 18:39 — Eddy, May 11 '10 at 18:39

score 3 · Answer 1 · answered May 11 '10 at 17:49

3

We use Cisco hardware-based firewalls rather than server software-based ones and they watch out for patterns of activity and block them for quite a while (30-90 days iirc). I'm sure other firewalls can do this but don't have experience. Basically what I'm saying is that if your firewall can use rules to look for abuse then you'll see the benefit over simply blocking known culprits.

answered May 11 '10 at 17:49

Chopper3

101,299
9
108
239

Chopper, any links on Cisco models configurations that do this, is websense needed? – Kyle Brandt May 11 '10 at 17:51
Have a look at Cisco's Intrusion Detection System Services Module (IDSM-2) for Cat65xx (http://cisco.com/en/US/products/hw/modules/ps2706/ps5058/index.html) and their ASA 5500 range (http://cisco.com/en/US/products/ps6120/index.html). I'm 99% sure that it doesn't use websense - you basically show it an example of the kind of behavior you want it to stop from previously logged examples, make any rule-mods you require and then leave it to it - basically it just sits and applies your rules to its blocking policy - you can easily define the usage profile of crawlers and it does the rest. – Chopper3 May 11 '10 at 18:00
+1 ah thanks, I have been reading up on this lately. I would probably look into the asa because all my 6500 switches are being used as foot stools now and they are just the right height ;-) – Kyle Brandt May 11 '10 at 18:04
Unfortunately I'm looking for a software solution. Thanks for your info though. – aardbol May 11 '10 at 18:13

Kyle Brandt · Answer 2 · 2010-05-11T17:52:16.480

1

If it is worthwhile is debatable and I don't really know.

As far as your complaint about the fact that they come from different IPs and you can only react by blocking the ip... You can fix this with a reverse proxy like Apache in reverse proxy mode (with something like mod_proxy / mod_security) or HAProxy. Basically if you know the patterns ahead of time you can just drop those requests before they even get to the webserver.

Also, for a bit of vocabulary these firewalls are call Web Application Firewalls (WAFs). They operate on Layer 7 by examining the HTTP requests and responses.

edited May 11 '10 at 17:52

answered May 11 '10 at 17:45

Kyle Brandt

83,619
74
305
448

1

This is cool too: http://www.owasp.org/index.php/Category:OWASP_Best_Practices:_Use_of_Web_Application_Firewalls – Warner May 11 '10 at 17:51
Warner: Actually was playing with HAProxy and its logs to get fail2ban to work with it (have the basic regex down). I question how much that could scale though. – Kyle Brandt May 11 '10 at 17:53
Oh, and thanks for that link that should could be busy for tonight's reading :-) – Kyle Brandt May 11 '10 at 17:54

score 0 · Answer 3 · answered May 11 '10 at 21:49

you could always take some of the strings/GETs you're finding and since you already have the string module for iptables, log/drop those packets, and potentially automate adding them to a firewall with a script.

generally speaking, i would say you're good to block those IPs, because they may have been compromised in some way or another, and if they've been compromised, and you're catching one attack, you might be missing another.

Is it worthwhile to block malicious crawlers via iptables?

3 Answers3