The following URLs are crawlers that have been banned by the IT Skeptic:
18.104.22.168 crawler.bloglines.com - controvertial banning, but they leave error messages everywhere and eat huge resource
22.214.171.124 A hungry crawler from irldotcsdottamudotedu, doing "research". Piss off.
126.96.36.199 out of Turkey
188.8.131.52, 184.108.40.206 out of Japan.
220.127.116.11 Japanese. Big resource eaters. Look like spammers
18.104.22.168 Chinese Sogou spider corp(dot)sohu(dot)com(slash)20051130(slash)n240842344(dot)shtml .
22.214.171.124 Dunno who it is, comes out of China, but very hungry spider
126.96.36.199 China Railway corporation! dear me. Ate 50% more than Google. And right after the Chinese Govt featured in a spoof on the IT Skeptic. Chinese spooks I reckon. So they can combine sex and travel, and ...
188.8.131.52 out of Vietnam
184.108.40.206 Dunno who dwhl.de are but they provide an anonymous ftp server too
87.99.76.% from Latvia (220.127.116.11 - 18.104.22.168 )
89.34.173.% from Romania
These crawlers are permitted to access this site:
22.214.171.124 News gator. High hits but low resource usage - nice people.
126.96.36.199, 188.8.131.52 - 184.108.40.206 Despite being by far the biggest remaining muncher of resource, who can say no to Google? But man Googlebot 220.127.116.11 is hungry!!
18.104.22.168, 22.214.171.124 "Burning Door", a.k.a feedburner. Better let them in, they serve my RSS feeds!
126.96.36.199 Nice light crawler
188.8.131.52 A Mickeysoft bot, average hunger but hits lightly
This list does not include known spammers who have been blocked. "Crawlers" are grouped (by me) by their high hit rates and/or high resource consumption: their intent may or may not be legitimate.