Site Links
Home
Features
Documentation
Pricing & Order
Members Area
Support Options
Who's Online
3 registered (Mtier, packlite, Thelockman), 26 Guests and 23 Spiders online.
Key: Admin, Global Mod, Mod
Featured Member
Registered: 06/07/06
Posts: 28
Top Posters (30 Days)
Ruben Rocha 95
Rick 85
Mark S 74
Gizmo 49
Thelockman 49
driv 35
Sirdude 32
ntdoc 28
packlite 27
AllenAyres 25
Latest Photos
bear test
Beach Barbie-Q
Sunset
Accept the challenge!
Trees
Topic Options
Rate This Topic
#218559 - 10/30/08 11:05 AM Spiders agressively spidering cache
doug Offline
member

Registered: 01/24/07
Posts: 124
Every day my cache gets aggressively spidered - usually a return visit looking for files which no longer exist - and apache crashes as a result.

Robots.txt does not stop them - so every time apache crashes I check the logs and block the offending IP. A few hours later a new IP spiders the cache (all overseas IP numbers). I do this several times a day which is getting annoying.

Is there a way to stop the spidering of the cache files without affecting the operation of the forum? A permissions setting perhaps?

Top
#218562 - 10/30/08 12:20 PM Re: Spiders agressively spidering cache [Re: doug]
Ruben Rocha Offline
addict
***

Registered: 12/20/03
Posts: 454
Loc: Lutz,FL
I would first check cp>primary settings>general tab>advanced options
See if Enable Spider-friendly URLs? is turned on. If it is I would start with turning that off.
_________________________
I am not an expert.
I just use the stuff!!
At the very least post your forum URL in your profile.
If you receive help here at least post back to let us know what the outcome is.
In case you did not know we are just users!

Top
#218567 - 10/30/08 01:33 PM Re: Spiders agressively spidering cache [Re: Ruben Rocha]
doug Offline
member

Registered: 01/24/07
Posts: 124
I would want to keep the spider friendly urls. I want the spiders to access the posts - but not the cached copy of the posts.

Top
#218582 - 10/30/08 04:47 PM Re: Spiders agressively spidering cache [Re: doug]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12089
Loc: Portland, OR; USA
If the spider is ignoring the robots.txt it's an agressive/abusive spider and you should probably ban it... Any idea which it is?
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#218585 - 10/30/08 05:25 PM Re: Spiders agressively spidering cache [Re: Gizmo]
doug Offline
member

Registered: 01/24/07
Posts: 124
The user agent just has the browser and operating system - and it is not the same for each incidence. I banned 4 IPs already today (and yesterday and the day before etc.) - they just keep coming.

Top


Moderator:  Gizmo 
Shout Box

Today's Birthdays
theregit
Recent Topics
7.4.2 UNREAD
by Thelockman
Today at 05:23 PM
7.4.1 show/hide category bug
by Seattlebrian
Today at 11:56 AM
7.4.2 Discussion
by Rick
Today at 10:21 AM
UBBCentral now running 7.4.2
by Rick
Today at 10:21 AM
New members don't get access and are not displayed in the config panel
by Yomar
Yesterday at 03:32 AM
Forum Stats
4298 Members
33 Forums
30693 Topics
156040 Posts

Max Online: 978 @ 06/24/07 08:19 PM