The following URLs are crawlers that have been banned by the IT Skeptic:
184.108.40.206 crawler.bloglines.com - controvertial banning, but they leave error messages everywhere and eat huge resource
220.127.116.11 A hungry crawler from irldotcsdottamudotedu, doing "research". Piss off.
18.104.22.168 out of Turkey
22.214.171.124, 126.96.36.199 out of Japan.
188.8.131.52 Japanese. Big resource eaters. Look like spammers
184.108.40.206 Chinese Sogou spider corp(dot)sohu(dot)com(slash)20051130(slash)n240842344(dot)shtml .
220.127.116.11 Dunno who it is, comes out of China, but very hungry spider
18.104.22.168 China Railway corporation! dear me. Ate 50% more than Google. And right after the Chinese Govt featured in a spoof on the IT Skeptic. Chinese spooks I reckon. So they can combine sex and travel, and ...
22.214.171.124 out of Vietnam
126.96.36.199 Dunno who dwhl.de are but they provide an anonymous ftp server too
87.99.76.% from Latvia (188.8.131.52 - 184.108.40.206 )
89.34.173.% from Romania
These crawlers are permitted to access this site:
220.127.116.11 News gator. High hits but low resource usage - nice people.
18.104.22.168, 22.214.171.124 - 126.96.36.199 Despite being by far the biggest remaining muncher of resource, who can say no to Google? But man Googlebot 188.8.131.52 is hungry!!
184.108.40.206, 220.127.116.11 "Burning Door", a.k.a feedburner. Better let them in, they serve my RSS feeds!
18.104.22.168 Nice light crawler
22.214.171.124 A Mickeysoft bot, average hunger but hits lightly
This list does not include known spammers who have been blocked. "Crawlers" are grouped (by me) by their high hit rates and/or high resource consumption: their intent may or may not be legitimate.