# Unknown guestbook spamming or harvesting tool from diff. IPs User-agent: 8484 Boston Project v 1.0 Disallow: / # Atomic Email Hunter email extracing and harvesting User-agent: Atomic_Email_Hunter/4.0 Disallow: / # atSpider (ceased) email harvester / spambot User-agent: atSpider/1.0 Disallow: / # Auto Email Pro Email harvester User-agent: autoemailspider Disallow: / # Email harvester User-agent: CherryPickerElite/1.0 Disallow: / # Email harvester User-agent: CherryPickerSE/1.0 Disallow: / # Probably E-Mail harvesting robot - same as LMQueueBot User-agent: ContactBot/0.2 Disallow: / # ContentSmartz e-mail harvesting tools User-agent: ContentSmartz Disallow: / # Email harvester User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0 Disallow: / # Some site scanning tool via diff. IPs i.e.: - wanweb.net (208.6.163.xxx) - cox.net (68.4.xxx.xxx) User-agent: DBrowse 1.4b Disallow: / # Some site scanning tool via diff. IPs i.e.: - pacbell.net (67.112.xxx.xxx) User-agent: DBrowse 1.4d Disallow: / # Some site scanning tool from 217.34.59.xxx (btopenworld.com) User-agent: Demo Bot DOT 16b Disallow: / # Some site scanning tool from 68.154.96.xx (bellsouth.net) User-agent: Demo Bot Z 16b Disallow: / # Some site scanning tool via diff. IPs i.e.: - cox.net (68.5.xxx.xxx) - pacbell.net (64.16x.xxx.xxx) User-agent: DSurf15a 01 Disallow: / # Some site scanning tool via diff. IPs i.e.: - cox.net (68.4.xxx.xxx) User-agent: DSurf15a 71 Disallow: / # Some site scanning tool via diff. IPs i.e.: - verizon.net (4.47.xxx.xxx) User-agent: DSurf15a 81 Disallow: / # Some site scanning tool via diff. IPs i.e.: - eastlink.ca (24.222.xxx.xxx) - cogeco.net (216.221.8x.xxx) User-agent: DSurf15a VA Disallow: / # Some site scanning tool via diff. IPs i.e.: - swbell.net (65.66.xxx.xxx) User-agent: EBrowse 1.4b Disallow: / # Some site scanning tool via diff. IPs i.e.: - cox.net (68.4.xxx.xxx) User-agent: Educate Search VxB Disallow: / # Email harvester User-agent: EmailCollector/1.0 Disallow: / # Sonic E-mail collector User-agent: EmailSiphon Disallow: / # EmailSpider E-mail harvesting software User-agent: EmailSpider Disallow: / # Trellian EMailWolf E-mail collector User-agent: EmailWolf 1.00 Disallow: / # Some site scanning tool via diff. IPs User-agent: ESurf15a 15 Disallow: / # Extractor Pro e-mail collector User-agent: ExtractorPro Disallow: / # Some spam bot User-agent: Franklin Locator 1.8 Disallow: / # Some site scanning tool via diff. IPs User-agent: FSurf15a 01 Disallow: / # Some site scanning tool from diff. IPs i.e.: - 66.28.240.xx (cogentco.com) - 68.5.174.xx (cox.net) User-agent: Full Web Bot 0416B Disallow: / # Some site scanning tool i.e. from - 68.154.96.xx (bellsouth.net) User-agent: Full Web Bot 0516B Disallow: / # Some site scanning tool from 66.255.6.xxx (uslec.com) User-agent: Full Web Bot 2816B Disallow: / # Guestbook spamming tool User-agent: Guestbook Auto Submitter Disallow: / # Spam bot from diff. IPs User-agent: Industry Program 1.0.x Disallow: / # Converas RetrievalWare Internet Spider (63.241.61.x) User-agent: infoConveraCrawler/0.8 ( http://www.authoritativeweb.com/crawl) Disallow: / # Unknown spambot / harvester from diff. IPs User-agent: ISC Systems iRc Search 2.1 Disallow: / # Some spam bot from 66.139.78.xx(x) User-agent: IUPUI Research Bot v 1.9a Disallow: / # Maybe logfile spamming for Lets crawl! search (Germany) User-agent: LetsCrawl.com/1.0 +http://letscrawl.com/ Disallow: / # Some spam bot User-agent: Lincoln State Web Browser Disallow: / # ThePlanet/jaja-jak-globusy.com Google Adsense refferer spam bot from 70.85.116.* / 70.84.128.xxx / 70.85.193.xxx User-agent: LWP::Simple/5.803 Disallow: / # Some spam bot User-agent: Mac Finder 1.0.xx Disallow: / # Microsoft Foundation Class Library - i.e. used for e-mail harvesting from 68.154.96.xx (bellsouth.net) User-agent: MFC Foundation Class Library 4.0 Disallow: / # user agent looks for form-mail components (spam-bot) User-agent: Microsoft URL Control - 6.00.8xxx Disallow: / # Some spam bot User-agent: Missauga Locate 1.0.0 Disallow: / # Some spam bot User-agent: Missigua Locator 1.9 Disallow: / # Some spam bot User-agent: Missouri College Browse Disallow: / # Some spam bot from Jasmine Internet - Bangkok (203.147.0.xx) User-agent: Mizzu Labs 2.2 Disallow: / # Unknown bad bot - maybe guestbook spamming or email harvesting User-agent: Mo College 1.9 Disallow: / # Unknown bad bot from diff. Taiwanese IPs User-agent: MVAClient Disallow: / # Borland Delphi .OCX component used by WebCollector email harverster User-agent: Mozilla/2.0 (compatible; NEWT ActiveX; Win32) Disallow: / # Faked user agent for diff. purposes i.e.: - some download manager - E-mail harvesting User-agent: Mozilla/3.0 (compatible) Disallow: / # Internet Direct Library for Borland (often used as e-mail address collector and mass mailing tool) User-agent: Mozilla/3.0 (compatible; Indy Library) Disallow: / # Mozilla/3.0 (compatible; Indy Library) User-agent: Mozilla/3.0 (compatible; scan4mail (advanced version) http://www.peterspages.net/?scan4mail) Disallow: / # Advanced Email Extractor e-mail collector (spam bot) User-agent: Mozilla/4.0 (compatible; Advanced Email Extractor v2.xx) Disallow: / # Iplexx Austria (webhosting company) logfile spamming bot User-agent: Mozilla/4.0 (compatible; Iplexx Spider/1.0 http://www.iplexx.at) Disallow: / # Beijing Express Email Address Extractor via DHCP Data Transport Services (DTS) User-agent: Mozilla/4.0 (compatible; MSIE 5.0; Windows NT; DigExt; DTS Agent Disallow: / # Maybe: - MS Internet Security & Acceleration Server (ISA) cache refreshing request (see link) or - IE 5.5 Win2000 probably with some (website) API request component (see 2nd link) - suspected as email-harvester / site scanning tool (see http://www.byte.com/documents/s=493/byt20010208s0001/index.htm User-agent: Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0) Fetch API Request Disallow: / # Unknown robot from 66.230.140.xx (argon.oxeo.com) maybe an e-mail collector User-agent: Mozilla/4.0 efp@gmx.net Disallow: / # Some spambot from Romania (82.208.139.1xx & 86.123.65.xx) - Maybe email harvesting User-agent: Mozilla/5.0 (Version: xxxx Type:xx) Disallow: / # Badbot searching for Wordpress wp-login.php User-agent: NameOfAgent (CMS Spider) Disallow: / # Unknown spambot / harvester from diff. IPs User-agent: NASA Search 1.0 Disallow: / # Nsauditor Network Security Auditor User-agent: Nsauditor/1.x Disallow: / # Some site scanning tool via diff. IPs- i.e.: - cox.net (68.4.xxx.xxx) User-agent: PBrowse 1.4b Disallow: / # Some site scanning tool via diff. IPs User-agent: PEval 1.4b Disallow: / # ThePlanet/jaja-jak-globusy.com Google Adsense refferer spam bot from 70.85.116.* / 70.84.128.xxx / 70.85.193.xxx User-agent: Poirot Disallow: / # Unknown spam bot / harvester (63.223.10.***) User-agent: Port Huron Labs Disallow: / # Some site scanning tool from diff. IPs- i.e.: - 67.99.33.x (lightningcon.broadwing.net) User-agent: Production Bot 0116B Disallow: / # Some site scanning tool from diff. IPs- i.e.: - 216.232.64.xx (telus.net) User-agent: Production Bot 2016B Disallow: / # Some site scanning tool from diff. IPs- i.e.: - 141.154.181.xxx (east.verizon.net) User-agent: Production Bot DOT 3016B Disallow: / # Some spam bot User-agent: Program Shareware 1.0.2 Disallow: / # Some site scanning tool via diff. IPs- i.e.: QWest Net User-agent: PSurf15a 11 Disallow: / # Some site scanning tool via diff. IPs- i.e.: Optonline net (24.191.xxx.xxx) User-agent: PSurf15a 51 Disallow: / # Some site scanning tool via diff. IPs- i.e.: - choiceone.net (216.153.xxx.xxx) - attbi.com (12.250.xxx.xxx) - optonline.net (24.191.xxx.xxx) User-agent: PSurf15a VA Disallow: / # Unknown website grabbing / ripping for unknown purposes from 208.66.195.x - Digitalinfinity.org Russia User-agent: psycheclone Disallow: / # Some site scanning tool via diff. IPs- i.e.: - dslx.net (208.35.1x.xxx) - Home.com User-agent: RSurf15a 41 Disallow: / # Some site scanning tool via diff. IPs- i.e.: - dslx.net (208.35.1x.xxx) - Home.com User-agent: RSurf15a 51 Disallow: / # Some site scanning tool via diff. IPs- i.e.: - dslx.net (208.35.1x.xxx) - Home.com User-agent: RSurf15a 81 Disallow: / # Unknown robot / website grabber from Chinatelecom (219.142.78.xxx) User-agent: searchbot admin@google.com Disallow: / # Unknown robot from Shablast.com - Website has no content - Ignores robots.txt User-agent: ShablastBot 1.0 Disallow: / # Unknown bot from bb2.net (66.234.139.xxx) also as Snapbot/1.0 User-agent: snap.com beta crawler v0 Disallow: / # Unknown bot from bb2.net (66.234.139.xxx) - also as snap.com User-agent: Snapbot/1.0 Disallow: / # Unknown bot from Psinet / Cogentco - not from Snap.com User-agent: Snapbot/1.0 (Snap Shots, +http://www.snap.com) Disallow: / # Unknown UA from Chinanet (220.181.26.1xx) faking Sogou search robot User-agent: sogou develop spider Disallow: / # Unknown UA from Chinanet (220.181.18.xx) faking Sogou search robot User-agent: Sogou Orion spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07) Disallow: / # Unknown UA from Chinanet (220.181.26.1xx) faking Sogou search robot User-agent: sogou spider Disallow: / # Unknown UA from Chinanet (220.181.26.1xx) faking Sogou search robot User-agent: Sogou web spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07) Disallow: / # Unknown UA from Chinanet (220.181.26.1xx) faking Sogou search robot User-agent: sohu agent Disallow: / # Some site scanning tool via diff. IPs i.e.: - choiceone.net (216.153.xxx.xxx) - epix.net (216.108.198.xx) User-agent: SSurf15a 11 Disallow: / # some bad user agent User-agent: TSurf15a 11 Disallow: / # Unknown mail harvester/spambot from 80.58.13.xxx (proxycache.rima-tde.net) User-agent: Under the Rainbow 2.2 Disallow: / # Malformed UA header from some guestbook/forum spammer User-agent: User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1) Disallow: / # Unknown bad behaving bot via Road Runner User-agent: VadixBot Disallow: / # Email harvester User-agent: WebBandit/2.1 Disallow: / # Email harvester User-agent: WebBandit/3.50 Disallow: / # Email harvester User-agent: Webbandit/4.00.0 Disallow: / # Web Vulnerability Crawler User-agent: WebVulnCrawl.unknown/1.0 libwww-perl/5.803 Disallow: / # Unknown spam bot / harvester (62.163.**.** / 62.194.**.*) User-agent: Wells Search II Disallow: / # Some spam bot User-agent: WEP Search 00 Disallow: / # Spam bot / harvester User-agent: User-agent: Wget Disallow: / User-agent: * Disallow: /2008_ Disallow: /2009_ Disallow: /app_code Disallow: /archives Disallow: /bdir Disallow: /bod Allow: /content Allow: /css Allow: /documents Allow: /forms Allow: /images Disallow: /logs Disallow: /mySQLbackup Allow: /photogallery Disallow: /phpMYAdmin Disallow: /ssl Disallow: /test Disallow: /tmp Disallow: /uploads Allow: /sitemap.xml Sitemap: http://pittsburghskiclub.org/sitemap.html # created 2009-10-12 (original issue) # updated 2009-10-21 (corrected Disallow for /2008_ and /2009_, corrected Allow for /images, removed duplicate entry for User-agent Wells Search II)