logo
Apache Lounge
Webmasters

 

About Forum Index Downloads Search Register Log in RSS X


Keep Server Online

If you find the Apache Lounge, the downloads and overall help useful, please express your satisfaction with a donation.

or

Bitcoin

A donation makes a contribution towards the costs, the time and effort that's going in this site and building.

Thank You! Steffen

Your donations will help to keep this site alive and well, and continuing building binaries. Apache Lounge is not sponsored.
Post new topic   Forum Index -> Third-party Modules View previous topic :: View next topic
Reply to topic   Topic: How to mitigate teleport pro automation webcrawler attack?
Author
maskego



Joined: 16 Apr 2010
Posts: 238

PostPosted: Mon 26 Mar '12 8:28    Post subject: How to mitigate teleport pro automation webcrawler attack? Reply with quote

Is there anyone can provide useful way to mitigate automation crawler attack to apace2.2.x site?

I use modsecurity can't stop it. Rolling Eyes
Back to top
tdonovan
Moderator


Joined: 17 Dec 2005
Posts: 611
Location: Milford, MA, USA

PostPosted: Mon 26 Mar '12 16:45    Post subject: Reply with quote

Teleport Pro isn't really an attack - it is a website browser which makes a lot of connections and requests as it scans your site.

If you want to block it in Apache 2.2, you could deny access based on its User Agent string, like this:
Code:
BrowserMatchNoCase Teleport TELEPORT=TRUE
Deny from env=TELEPORT

In Apache 2.4 it is easier using the new expressions which are available:
Code:
Require expr %{HTTP_USER_AGENT} !~ /Teleport/i

If this doesn't work for you - and it is just one person accessing your site with Teleport Pro - you could look in your access.log for their IP address (xxx.xxx.xxx.xxx) and block it:
Code:
Deny From xxx.xxx.xxx.xxx

For Apache 2.4:
Code:
Require not ip xxx.xxx.xxx.xxx

-tom-
Back to top
maskego



Joined: 16 Apr 2010
Posts: 238

PostPosted: Tue 27 Mar '12 2:24    Post subject: Reply with quote

tom,

Thank your reply.

Teleport is just a browser,but I think it is a malware scanner.It will increase your site's loading by illegal usage.And,it scans all your site.

I try your way to deny its user-agent at .htaccess.But,it doesn't take effects.Is there any good idea to block it via modsecurity?(the newest teleport pro1.65)
Back to top
Steffen
Moderator


Joined: 15 Oct 2005
Posts: 3092
Location: Hilversum, NL, EU

PostPosted: Tue 27 Mar '12 10:48    Post subject: Reply with quote

The following rule set checks the user Agent:

modsecurity_crs_35_bad_robots.conf , it already checks "teleport pro".


Steffen
Back to top
maskego



Joined: 16 Apr 2010
Posts: 238

PostPosted: Tue 27 Mar '12 12:16    Post subject: Reply with quote

The strange thing is that I make a test to use teleport pro to mirror my live site,but the teleport pro take effects.(teleport pro download link:http://www.tenmax.com/teleport/pro/download.htm)

I am sure to apply this rule of mod_security CRS.

Does it mean that mod_security can't stop teleport pro automation web crawler? Rolling Eyes Or what setting should I revise?
Back to top
Steffen
Moderator


Joined: 15 Oct 2005
Posts: 3092
Location: Hilversum, NL, EU

PostPosted: Tue 27 Mar '12 12:52    Post subject: Reply with quote

I stops with the above rules.
I tried it with curl:

curl 127.0.0.1/test -A "bla bla Teleport pro bla bla"

ModSecurity: Access denied with code 403 (phase 2). Pattern match "(?iSad?:c(?:o(?:n(?:t(?:entsmartz|actbot/)|cealed defense|veracrawler)|mpatible(?: ;(?: msie|\\\\.)|-)|py(?:rightcheck|guard)|re-project/1.0)|h(?:ina(?: local browse 2\\\\.|claw)|e(?:rrypicker|esebot))|rescent internet toolpak)|w(?:e(?:b(?: (?:downloader|by ..." at REQUEST_HEADERS:User-Agent. [file "D:/servers/apache/conf/httpd.conf"] [line "523"] [id "990012"] [rev "2.2.4"] [msg "Rogue web site crawler"] [data "teleport pro"] [severity "WARNING"] [tag "AUTOMATION/MALICIOUS"] [tag "WASCTC/WASC-21"] [tag "OWASP_TOP_10/A7"] [tag "PCI/6.5.10"] [hostname "127.0.0.1"] [uri "/test"] [unique_id "T3Gah8CoAQQAABcEN08AAABz"]

Steffen

ps.
You quote a lot in your posts, see the forum rules. Unnecessary use of quotes (for a topic history, scroll up). I removed a lot in your posts.
Back to top
maskego



Joined: 16 Apr 2010
Posts: 238

PostPosted: Tue 27 Mar '12 12:58    Post subject: Reply with quote

I doubt the newest teleport pro doesn't use teleport pro user-agent.Actually,I use teleport pro 1.65 to mirror a site,there is no error shows or be stopped.

I don't see those detection messages at logs.

Can you tell me where should I revise?

And,what steps can I do the same test like you did under windows os? Embarassed
Back to top
Steffen
Moderator


Joined: 15 Oct 2005
Posts: 3092
Location: Hilversum, NL, EU

PostPosted: Tue 27 Mar '12 13:12    Post subject: Reply with quote

Look in your logs to see what user agent is used.

A curl.exe you can find in ApacheStats.zip at the additional download page.

Btw.
A lot of scanners/crawlers are around and most use a "normal" user agent. To stop is mostly hard, and have to live with it.

Steffen
Back to top
maskego



Joined: 16 Apr 2010
Posts: 238

PostPosted: Tue 27 Mar '12 13:25    Post subject: Reply with quote

The bad teleport pro uses the "User-Agent: Mozilla/4.0 (compatible; MSIE 7.0; Windows NT)" and "User-Agent: Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; Trident/5.0)" to crawler.I find it at logs.

How to block it?
Back to top
Steffen
Moderator


Joined: 15 Oct 2005
Posts: 3092
Location: Hilversum, NL, EU

PostPosted: Tue 27 Mar '12 13:34    Post subject: Reply with quote

That are valid/normal user agents. When you block based on these user agents then you block also users.

Only way to block a IP you know is using teleport, but that make not so much sense.

So, i think you have to live with it. Here crawlers sucking the site daily a few Gigs, most webmasters dealing with it.


Steffen
Back to top
glsmith
Moderator


Joined: 16 Oct 2007
Posts: 2268
Location: Sun Diego, USA

PostPosted: Tue 27 Mar '12 22:02    Post subject: Reply with quote

maskego,

You say "bad Telepoert pro" like there is just the one. Does this bad teleport pro have just one IP? If so, block it's IP address instead.
Back to top
maskego



Joined: 16 Apr 2010
Posts: 238

PostPosted: Wed 28 Mar '12 2:41    Post subject: Reply with quote

gl:

There are many web crawlers not just one.
I call "teleport pro" as bad one,it means the teleport pro making abuse net freely in spite of webmasters.

steffen:

As you says,web crawlers suck the sites always.It not perfectly to ban ips.I feel it's a blind way to anti malicious web crawlers or scanners or robots.

Is it possible to find the web crawlers specific character to stop it before sucking the site?(When it search the robots.txt,how can we use modsecurity to ban it's ip automatically?Where can I find this rule?)

Can someone report the need to the modsecurity? Maybe it makes disgusting web crawlers to go away. Laughing
Back to top


Reply to topic   Topic: How to mitigate teleport pro automation webcrawler attack? View previous topic :: View next topic
Post new topic   Forum Index -> Third-party Modules