The following URLs are crawlers that have been banned by the IT Skeptic:
188.8.131.52 crawler.bloglines.com - controvertial banning, but they leave error messages everywhere and eat huge resource
184.108.40.206 A hungry crawler from irldotcsdottamudotedu, doing "research". Piss off.
220.127.116.11 out of Turkey
18.104.22.168, 22.214.171.124 out of Japan.
126.96.36.199 Japanese. Big resource eaters. Look like spammers
188.8.131.52 Chinese Sogou spider corp(dot)sohu(dot)com(slash)20051130(slash)n240842344(dot)shtml .
184.108.40.206 Dunno who it is, comes out of China, but very hungry spider
220.127.116.11 China Railway corporation! dear me. Ate 50% more than Google. And right after the Chinese Govt featured in a spoof on the IT Skeptic. Chinese spooks I reckon. So they can combine sex and travel, and ...
18.104.22.168 out of Vietnam
22.214.171.124 Dunno who dwhl.de are but they provide an anonymous ftp server too
87.99.76.% from Latvia (126.96.36.199 - 188.8.131.52 )
89.34.173.% from Romania
These crawlers are permitted to access this site:
184.108.40.206 News gator. High hits but low resource usage - nice people.
220.127.116.11, 18.104.22.168 - 22.214.171.124 Despite being by far the biggest remaining muncher of resource, who can say no to Google? But man Googlebot 126.96.36.199 is hungry!!
188.8.131.52, 184.108.40.206 "Burning Door", a.k.a feedburner. Better let them in, they serve my RSS feeds!
220.127.116.11 Nice light crawler
18.104.22.168 A Mickeysoft bot, average hunger but hits lightly
This list does not include known spammers who have been blocked. "Crawlers" are grouped (by me) by their high hit rates and/or high resource consumption: their intent may or may not be legitimate.