Site Links
Home
Features
Documentation
Pricing & Order
Members Area
Support Options
Who's Online
5 registered (blaaskaak, Mitch P., Ruben Rocha, Thelockman, 1 invisible), 33 Guests and 21 Spiders online.
Key: Admin, Global Mod, Mod
Featured Member
Registered: 07/04/06
Posts: 4044
Top Posters (30 Days)
Ruben Rocha 185
Gizmo 110
Rick 101
Thelockman 83
driv 41
AllenAyres 35
ntdoc 28
Sirdude 27
Ian 24
ScriptKeeper 18
Latest Photos
My Home System
test photo gallery
Bernese Mountain Dogs
My Daimler
Dorado and shark
Page 1 of 3 1 2 3 >
Topic Options
Rate This Topic
#197818 - 09/20/07 11:59 PM Twiceler, how I loathe thee...
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
Well, I've gone the last 3 days with UGN down, the culprit? The constant indexing and re-indexing from my little supposed "friend" Twiceler...

Twiceler is a supposed "new search engine" which just doesn't exist... why they have to constantly reindex I have no clue, but it does nothing more than RAPE my bandwidth...

It was ok for a while, just a little here, and a little there... Then I started going in and looking at my weblogs... 5+gb of unknown bandwidth usage that had to be cut down... And what a better time than being held offline due to a flood of traffic...

I finally get back online, and check the resources in current use... There's my "friend" Twiceler... with 8 threads just chumming away...

You all know me, the guy who says "well, if they have the money for this many servers they HAVE to be making something good, just wait it out"... Tonight, I'm no longer the passive voice of reason, Twiceler is a waste of my time, resources, and money.

Ways of BANNING Twiceler.

Robots.txt is the quickest way to ban robots, BUT not all check back immediately, there can be a several day, week, or month passive check on their cache; so if it can wait, oh well, here you go...

Robots.txt
 Code:
User-agent: twiceler
Disallow: /


.htaccess (apache, you should also be able to define this in your httpd.conf)
here, we ban their IP's! These class C's are actually provided on their website, so we can easily ban them with 3 sections (note, you only need to turn the rewrite engine "on" ONCE in your .htaccess file)
 Code:
RewriteEngine on
# Deny users IP's #
order allow,deny
deny from 38.99.13.
deny from 64.1.215.
deny from 208.36.144.
allow from all


Ban the U/A in your .htaccess
These lines will have your webserver ban the useragent before they can even access your site.
 Code:
RewriteEngine on
# Block Bad Bots #
RewriteCond %{HTTP_REFERER} cuill\.com [OR]
RewriteCond %{HTTP_USER_AGENT} Twiceler [OR]
RewriteRule .* - [F,L]


3 ways to show our "good friend" Twiceler some good old-fashioned "hard love"...
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#197830 - 09/21/07 09:47 AM Re: Twiceler, how I loathe thee... [Re: Gizmo]
AllenAyres Offline

****

Registered: 12/29/03
Posts: 1639
Loc: Texas
Can you send the bots back to cuill.com? Hopefully we can get it to fold in on itself.
_________________________
- Allen
- ThreadsDev | PraiseCafe

Top
#197861 - 09/21/07 03:50 PM Re: Twiceler, how I loathe thee... [Re: AllenAyres]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
'eh you can send a forward request, but I'd bet that their bots are configured to ignore their homepage ;\)
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#197875 - 09/21/07 09:14 PM Re: Twiceler, how I loathe thee... [Re: Gizmo]
AllenAyres Offline

****

Registered: 12/29/03
Posts: 1639
Loc: Texas
hmmm... we need to create a black hole web site to forward them to - a site that only uses the images and css sheets from cuill.com and just sends the bot back and forth over 2-3 pages, the page changes content just barely everytime it's accessed. The bot would think there were billions of pages to index. ;\)
_________________________
- Allen
- ThreadsDev | PraiseCafe

Top
#197879 - 09/22/07 12:02 AM Re: Twiceler, how I loathe thee... [Re: AllenAyres]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
'eh would waste your wn bw :/
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#198015 - 09/24/07 06:03 AM Re: Twiceler, how I loathe thee... [Re: Gizmo]
Mark S Offline
Carpal Tunnel
***

Registered: 07/04/06
Posts: 4044
Loc: Liverpool : England : UK
I know you guy are trying to block it etc.
But in my post i just sent them an e-mail
and it never came back again ;\)

Click Me
_________________________
Version v7.2.2 smile smile < Threads satisfaction status
People who inspire me Rick Gizmo Ian David jgeoff ntdoc
To answer the question you must first give a question.

Top
#198026 - 09/24/07 09:48 AM Re: Twiceler, how I loathe thee... [Re: Mark S]
Rick Administrator Offline

*****

Registered: 06/04/06
Posts: 7904
Loc: Aberdeen, WA
This topic has been getting a lot of visits from various search engines. Topic is only 4 days old, and looking in the referer logs there are at least 100 hits from people coming from google or yahoo.
_________________________
UBB.threads™ Developer
My Personal Website · StogieSmokers.com

Top
#198049 - 09/24/07 06:39 PM Re: Twiceler, how I loathe thee... [Re: Rick]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
If you simply search for issues you'll find plenty of them... I have heard of plenty of users asking for it to be blacklisted and they just don't care, or will return after a while; so I decided to take a more direct approach and force it to never return...

And I'm glad to hear of the popularity! lol...
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#216291 - 08/07/08 11:21 AM Re: Twiceler, how I loathe thee... [Re: Gizmo]
ScriptKeeper Offline
old hand
***

Registered: 12/09/06
Posts: 1039
Loc: UK
Update on this. Maybe you shouldn't block this any more as it is being used by a new Google rival - Cuil.

(Thread)


Top
#216294 - 08/07/08 11:43 AM Re: Twiceler, how I loathe thee... [Re: ScriptKeeper]
driv Offline
Pooh-Bah
****

Registered: 01/10/04
Posts: 1703
Loc: Essex, UK
Google has a rival?
Bigger, better?

So was Betamax.... wink
_________________________
Oi Oi Saveloy!
(Courtesy of Sd - well known Anglophile...!?!)
My True star rating wink

Top
#216310 - 08/08/08 01:12 AM Re: Twiceler, how I loathe thee... [Re: driv]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
Cuil was created by the Google creators; I've yet to really use it, but I'll dig around later

BTW, I banned them due to mass abuse/bw usage in the first place lol


Edited by Gizmo (08/08/08 01:12 AM)
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#216324 - 08/08/08 01:12 PM Re: Twiceler, how I loathe thee... [Re: Gizmo]
David Dreezer Offline
Pooh-Bah
*****

Registered: 07/21/06
Posts: 1792
Some of our customers have banned twiceler because it has brought their sites down. There is just no need to abuse a forum the way they do. It's almost like a DOS attack the way they hit you so hard.
_________________________
What do you mean "You're the bomb, run away?"

Top
#216333 - 08/08/08 03:57 PM Re: Twiceler, how I loathe thee... [Re: David Dreezer]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
Originally Posted By: David Dreezer
It's almost like a DOS attack the way they hit you so hard.
My point exactly! it was crazy when I was hit by them, resources dropped to near nothing, and all I saw was them and yahoo on my forums lol
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#216389 - 08/11/08 03:06 PM Re: Twiceler, how I loathe thee... [Re: Gizmo]
FordDoctor Offline
journeyman

Registered: 06/05/06
Posts: 92
Loc: New Joisey!
Stupid question... or maybe a good one for those of us who are not as skilled and knowledgeable as you Gents:

How do you know if your site is being abused by Twiceler?
_________________________
Ford diesel master technician by day...
Webmaster by night! cool
FordDoctorsDTS.com running UBB Threads 7.0.2

Top
#216392 - 08/11/08 03:29 PM Re: Twiceler, how I loathe thee... [Re: FordDoctor]
driv Offline
Pooh-Bah
****

Registered: 01/10/04
Posts: 1703
Loc: Essex, UK
A good question indeed smile

You only need to click on 'Who's Online' to be able to see a list of spiders etc smile
_________________________
Oi Oi Saveloy!
(Courtesy of Sd - well known Anglophile...!?!)
My True star rating wink

Top
#216394 - 08/11/08 04:12 PM Re: Twiceler, how I loathe thee... [Re: driv]
FordDoctor Offline
journeyman

Registered: 06/05/06
Posts: 92
Loc: New Joisey!
Great! But I am still running version 7.0.2 which would explain why my "who's on-line" does not show spiders. blush
_________________________
Ford diesel master technician by day...
Webmaster by night! cool
FordDoctorsDTS.com running UBB Threads 7.0.2

Top
#216398 - 08/11/08 05:11 PM Re: Twiceler, how I loathe thee... [Re: FordDoctor]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
WOL + Google Analyicts + AWStats led to noting the abuse; as they where hammering the hell out of the server...

Additionally, I have server information logged out to all of the ip's accessing the server and their hostname; twinceler was listed constantly.

It all added up to me ending up banning the hell out of them
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#216447 - 08/13/08 11:10 AM Re: Twiceler, how I loathe thee... [Re: Gizmo]
David Dreezer Offline
Pooh-Bah
*****

Registered: 07/21/06
Posts: 1792
AW stats, webalyzer, any log parser.
_________________________
What do you mean "You're the bomb, run away?"

Top
#217131 - 09/18/08 10:45 PM Re: Twiceler, how I loathe thee... [Re: David Dreezer]
bakerzdosen Offline
stranger

Registered: 09/18/08
Posts: 14
Loc: Utah
I hate for something like this to be my first post, but alas...

What are the chances of twiceler/cuill causing our site to basically come to it's knees?

Usually, everything is fine, but tonight, suddenly, mysqld is consuming a LOT of CPU time, and it's not going away. (In a dual proc setup, it's going from 98% to 160% cpu.) There were TONS of httpd procs/connections open. I initially thought it was an index problem, but once I saw this thread, I figured I'd look at the access logs, and sure enough... there were a LOT of twiceler requests in the timeframe of the slowdown.

For us, it's not a bandwidth issue, but a cpu issue. Has that been the case for anyone else or should I pursue my index theory?
_________________________
bmwsporttouring.com

Top
#217136 - 09/19/08 03:16 AM Re: Twiceler, how I loathe thee... [Re: bakerzdosen]
ScriptKeeper Offline
old hand
***

Registered: 12/09/06
Posts: 1039
Loc: UK
Well, you could try adding a Crawl Delay to your robots.txt which should slow down the amount of visits the robot makes.

Example:

10 seconds:

Code:
User-agent: Twiceler
Crawl-delay: 10


2 minutes:

Code:
User-agent: Twiceler
Crawl-delay: 120

Top
#217138 - 09/19/08 04:26 AM Re: Twiceler, how I loathe thee... [Re: ScriptKeeper]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
I doubt they'll respect it... they aren't kind at ALL...
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#217152 - 09/19/08 11:09 AM Re: Twiceler, how I loathe thee... [Re: Gizmo]
bakerzdosen Offline
stranger

Registered: 09/18/08
Posts: 14
Loc: Utah
Well, I tried the IP banning approach. We'll see how it goes.

One thing though: We're on 7.2.2, and at one point I was seeing a bunch of twiceler requests in the accesslog, but I don't see anything other than google, msn, and yahoo in the spider list on the who's online page. Is this because of our version being slightly older or does cuil do something different to simply show up as an anonymous user?

Also, I noticed from our logs that they're using the 38.99.44.x subnet as well, so I added that.
_________________________
bmwsporttouring.com

Top
#217155 - 09/19/08 04:33 PM Re: Twiceler, how I loathe thee... [Re: bakerzdosen]
Gizmo Moderator Offline

***

Registered: 06/04/06
Posts: 12007
Loc: Portland, OR; USA
Well, the UserAgent needs to be added to the CP, otherwise the search engine will show as anon...

There is a thread here somewhere showing different UA strings that several of us worked on... I really should sticky it, wherever it is...
_________________________
UGN Security, Elite Web Gamers & VNC Web Design Owner
Longtime UBB Supporter, UBB7 Beta Tester & Resident Post-A-Holic

Top
#217162 - 09/20/08 09:39 AM Re: Twiceler, how I loathe thee... [Re: Gizmo]
Mike L Offline
stranger

Registered: 06/05/06
Posts: 24
Gizmo,

Allow me to make a very small contribution.

Here are two relevant threads. Perhaps one of these is the one you were thinking of.

UA Strings 1

UA Strings 2


Top