Site Links
Home
Features
Documentation
Pricing & Order
Members Area
Support Options
UBBDev.com
UBBWiki.com
Who's Online
2 registered (SteveS, Gizmo), 33 Guests and 15 Spiders online.
Key: Admin, Global Mod, Mod
Featured Member
Registered: 06/07/07
Posts: 4
Top Posters (30 Days)
Ruben 51
Gizmo 24
DennyP 24
Dunny 15
SteveS 14
AllenAyres 12
dbremer 10
SD 10
drkknght00 9
doug 8
Latest Photos
OK Corral Shoot Out
Testing
Basildon Train Station
Basildon Town Centre looking from the rounderbout
Basildon Town Square
Page 1 of 2 1 2 >
Topic Options
#191359 - 07/16/07 07:37 PM Yahoo pounds our board continuously? Anyone else?
Architecht Offline
journeyman
Registered: 06/06/06
Posts: 86
Since installing 7x and the spider tracking, I've noticed that Yahoo is all over our boards all of the time.

Our Who's Online

What gives? I am assuming something is wrong or yahoo wouldn't be doing this 24 hours / day.
_________________________
Top
Express Hosting
Express Hosting "We are the official hosting company of UBB.threads. Ask us about our free migration services to migrate your UBB.threads installation."
#191367 - 07/16/07 07:54 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Architecht]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
I have 4 communities, and I can view them at several other sites; it's normal; Google used to do this a lot as well, but it seems they streamlined how they crawl data
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#191487 - 07/17/07 10:22 AM Re: Yahoo pounds our board continuously? Anyone else? [Re: Gizmo]
Architecht Offline
journeyman
Registered: 06/06/06
Posts: 86
I dunno... it seems weird. I agree that it's a quirk in the yahoo spider, but it's probably interacting with something on the boards to loop continuously over the same junk?

That's a performance hit I could do without if we could track it down.

Has anyone analyzed their logs to see what the yahoo spiders are actually doing? Is it legit traffic or are they caught somehow?
_________________________
Top
#191520 - 07/17/07 01:07 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Architecht]
AllenAyres Offline

Registered: 12/29/03
Posts: 1995
Loc: Texas
They show up in the WOL data reading different topics, forums, etc. It doesn't seem like they're 'stuck', it does seem like yahoo just sends a plethora of them out daily tho, one right after another.
_________________________
- Allen
- ThreadsDev | PraiseCafe
Top
#191537 - 07/17/07 02:04 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: AllenAyres]
Architecht Offline
journeyman
Registered: 06/06/06
Posts: 86
Hm.

Wonder if they're somehow getting an error on their end parsing what they've collected and just go back to retry the next day. \:\(
_________________________
Top
#191608 - 07/17/07 08:48 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Architecht]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
I see seperate IP's on differant pages; so it's not that they're getting "stuck" it's just that they're sending a lot of bots out... It shouldn't effect anything too much (50 bots don't take up as much resources as you'd think).

There are some robots.txt rules to keep bots out of "un-needed spots" (such as the calendar, where they'll incriment day by day into oblivian).
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#191978 - 07/19/07 01:18 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Gizmo]
Architecht Offline
journeyman
Registered: 06/06/06
Posts: 86
50 bots would be nice. Right now I have 171 bots on my site, by far most of them are yahoo. And this is ALL The time.

http://boards.collectors-society.com/ubbthreads.php?ubb=online

There's just got to be something wrong with that.


Edited by Architecht (07/19/07 01:18 PM)
_________________________
Top
#191997 - 07/19/07 04:22 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Architecht]
ntdoc Offline
Registered: 11/09/06
Posts: 3384
Well even here on UBB they were up to about 700 bots at one time.
Top
#192020 - 07/19/07 06:16 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: ntdoc]
teamzr1 Offline
enthusiast
Registered: 06/19/07
Posts: 249
MSN, Yahoo and Google are always on my forum and I do not like it.
They in fact should pay all of us for they are getting our content for free to users they are charging a fee to use their system and the money they make from vendors who pay them.

If they do not pay then there should be some methods we use to block them as I also get some koint out of japan that also sucks onto our content.

I have never made a penny from anyone coming via those ISPs
_________________________
JR
Team ZR-1 Corvette Racer's
Top
#192037 - 07/19/07 07:56 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: teamzr1]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
Actually, I feel the opposite, I feel we shoudl pay them... Think of it, they download our pages for their DB, their users search their database and they send their users to our site, which tend to register and click advertising links which in turn make us money... For those of us who advertise anyway...

Now, if you want to stop them from visiting your site at all, you can, it's what robots.txt is for, just stop them from visiting your forums and never worry about them again (though you'll soon notice traffic decreases, and some sites depend on search engines for new users)
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#192058 - 07/19/07 08:35 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Gizmo]
Architecht Offline
journeyman
Registered: 06/06/06
Posts: 86

I don't mind them being there, I'm just assuming that all the thrashing going on indicates something bad as it's seems inefficient on its face.
_________________________
Top
#192067 - 07/19/07 10:43 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Architecht]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
Well, there are some "black holes", such as the calendar, where they can go up day by day into oblivion, but that's an easy fix with robots.txt:
User-agent: *
Disallow: /forum/ubbthreads.php?ubb=calendar
Disallow: /forum/ubbthreads.php/ubb/calendar
Disallow: /forum/ubbthreads.php?ubb=showday
Disallow: /forum/ubbthreads.php/ubb/showday
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#192080 - 07/19/07 11:47 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Gizmo]
teamzr1 Offline
enthusiast
Registered: 06/19/07
Posts: 249
I never in almost 10 years with a forum online have had one person who registered by having gone through one of those ISPs.
In any case what they are charging their customers as a product is our content we not only produce but pay for the domain and webhosting costs


 Originally Posted By: Gizmo
Actually, I feel the opposite, I feel we shoudl pay them... Think of it, they download our pages for their DB, their users search their database and they send their users to our site, which tend to register and click advertising links which in turn make us money... For those of us who advertise anyway...

Now, if you want to stop them from visiting your site at all, you can, it's what robots.txt is for, just stop them from visiting your forums and never worry about them again (though you'll soon notice traffic decreases, and some sites depend on search engines for new users)
_________________________
JR
Team ZR-1 Corvette Racer's
Top
#192081 - 07/19/07 11:50 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: teamzr1]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
You've never had a user who's come to your site through a search engine and registered? Somehow I find that hard to believe, unless theres some really un-searched for content on your site...

IF you want to block SE's all together, just add this to your robots.txt:
User-agent: *
Disallow: /

They'll never touch your site again (so long as they follow the robots.txt standard, which most major ones do)

Honestly though, the max BW i've ever seen wasted by SE's in a month is about a gig; and this was on a huge site with loads of content that bankrolls about 4k+ a month due to advertising and depends on search engines...
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#193638 - 08/02/07 01:21 AM Re: Yahoo pounds our board continuously? Anyone else? [Re: Architecht]
Joe Siegler Offline
newbie
Registered: 12/30/03
Posts: 30
Loc: Garland, TX
 Originally Posted By: Architecht
50 bots would be nice. Right now I have 171 bots on my site, by far most of them are yahoo. And this is ALL The time.

http://boards.collectors-society.com/ubbthreads.php?ubb=online

There's just got to be something wrong with that.


Yahoo is a pig. Googlebot and the others are nowhere near as bad as Yahoo is. I've been doing this since 1995, so I have some idea of what I'm talking about. \:\)

Right now I have 4 registered users on, 6 guests, and 135 spiders. Almost all of them are Yahoo from various IP addresses - as was pointed out that's not one spider stuck, that's a ton of them. There's no reason why they should be that friggin piggish.

Makes me wonder if I shouldn't do something like this to them.

However, Yahoo's own help has some ideas:

http://help.yahoo.com/l/us/yahoo/search/webcrawler/slurp-03.html
_________________________
Joe Siegler - Webmaster
3D Realms & Black Sabbath Online
Top
#193639 - 08/02/07 02:08 AM Re: Yahoo pounds our board continuously? Anyone else? [Re: Joe Siegler]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
I will agree that the crawler delay option they mention would be a valid way to slow down pounding; thanks for the link Joe :).

There are some areas of the UBB that spiders will get stuck in (as mentioned in the faq), the calendar is one of them, as are member files (neither of which need to be crawled).
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#193756 - 08/03/07 11:31 AM Re: Yahoo pounds our board continuously? Anyone else? [Re: Gizmo]
Joe Siegler Offline
newbie
Registered: 12/30/03
Posts: 30
Loc: Garland, TX
Look at this.

http://www.black-sabbath.com/forums/ubbthreads.php?ubb=online

Right now I have 3 users, 10 guests, and 162 friggin search spiders. The overhelming majority are Yahoo. I implemented the delay option, it didn't seem to make much of a difference. \:\(

My Texas Rangers site isn't nearly as bad. 1 user (me), 0 guests, and 9 spiders (all but one are Yahoo). Sigh.

http://www.rangerfans.com/forums/ubbthreads.php?ubb=online

Unless I did it wrong, but I don't think so.
_________________________
Joe Siegler - Webmaster
3D Realms & Black Sabbath Online
Top
#193773 - 08/03/07 07:27 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Joe Siegler]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
Lol, 160 yahoo spiders is nothing; I had 500 the one night lol
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
#193785 - 08/03/07 11:53 PM Re: Yahoo pounds our board continuously? Anyone else? [Re: Gizmo]
teamzr1 Offline
enthusiast
Registered: 06/19/07
Posts: 249
I added a delay to robots.txt
its in the root oy my domain

User-agent: *
Disallow: /cgi-bin/

User-agent: Slurp
Crawl-delay: 5

Has not slowed down Yahoo slurp ( which must mean they are saying they want to suck up everyones content ) one bit
_________________________
JR
Team ZR-1 Corvette Racer's
Top
#193791 - 08/04/07 05:52 AM Re: Yahoo pounds our board continuously? Anyone else? [Re: teamzr1]
Gizmo Online   cat

Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
Well, they don't check robots.txt every time they request something; they request it at a set interval that can take up to a few weeks to pass.
_________________________
Forums: UGN Security & VNC Web Design & Development
UBB.Threads: UBB.Wiki, My UBBSkins, UBB.Sitemaps
Longtime UBB Supporter, UBB Beta Tester & Resident Post-A-Holic.
UBB Modifications, Styling, Coding Services, Disaster Recovery, and more!
Top
Page 1 of 2 1 2 >



Moderator:  AllenAyres, Harold, Ian, Ron M 
Shout Box

Today's Birthdays
No Birthdays
Recent Topics
Temporary Password email not being received
by
Yesterday at 10:02 PM
Ability to "like" individual posts (not Facebook "likes)
by doug
05/23/12 09:03 AM
Island Permissions
by ThreadsUser
05/22/12 03:03 PM
streaming video
by prkrgrp
05/20/12 07:02 PM
New Posts Corrupted? Can someone help?
by PianoWorld
05/19/12 09:41 AM
Forum Stats
10489 Members
36 Forums
33841 Topics
181707 Posts

Max Online: 978 @ 06/24/07 11:19 PM
Random Image