 |
 |
 |
 |
Registered: 06/05/06
Posts: 709
|
|
|
 |
 |
 |
 |
|
 |
 |
 |
 |
#221617 - 01/26/09 02:56 PM
Re: Search engine
[Re: Gizmo]
|
stranger
|
Registered: 02/16/06
Posts: 6
Loc: North-Central Texas
|
|
Hey Gizmo- You seem to be at the leading edge of search engine promotion with respects to the UBB Threads board and generating a sitemap for it. I am an ASP programmer and had already written code a few years ago that used the File Scripting Object to iterate through all the flat files generated by the old UBB and generate URL's to all the posts and topics for inclusion in my sitemap. We switched to Threads 7.2 several months ago, and I just now got around to modifying my sitemap generator to include the Threads-based URLs. I basically just wrote code that uses ADO to open the MySQL database and iterate through the ubbt_topics, ubbt_posts, and ubbt_users tables to generate all the possible URLs. I'm sure the resulting XML that is written is correctly formatted and valid, but much to my dismay, after submitting the newly generated sitemap to Google, they aren't indexing ANY of the URLs! Did you have any problems with Google not indexing the URLs your sitemap generator generates? Did it just take a while for them to get around to indexing them? The sitemap in question doesn't just include generated URLs for the Threads board, but also from our inventory database and other sources. If you want to see what my URL's look like, it is located at http://www.myfootshop.com/myfootshopcom-sitemap.xml It includes over 33,000 URLs, but the first ones in the XML file are the Threads URLs.
|
|
Top
|
|
|
|
|
 |
 |
 |
 |
 |
 |
 |
 |
|
Express Hosting
"We are the official hosting company of UBB.threads. Ask us about our free migration services to migrate your UBB.threads installation."
|
|
|
 |
 |
 |
 |
 |
 |
 |
 |
#221618 - 01/26/09 03:34 PM
Re: Search engine
[Re: embryo]
|
|
Registered: 06/05/06
Posts: 14995
Loc: Portland, OR; USA
|
|
Well, theres a couple of hints for you... You should limit your sitemap to 25k urls per page, use a sitemap index if you need to go over.
Theres no need to have your users or posts in a sitemap, all you need is the topic link as the posts will all go here, you'll just be feeding it more data than it needs.
A sitemap won't get all of your stuff crawled overnight; you're simply aiding the SE's discovery bot in finding new content, it can still take months for all of that yummy content to get indexed.
A huge sitemap will cause strain on your server, the sitemap index will definately help ease requests, especially if the sitemap itself isn't cached and builds live, you'll be putting strain on the sql server for having it push out all possible urls.
Also, try avoiding duplicate URLs, once it has a url it'll crawl all possible urls on that page, feeding it say 5 possible urls to the same page is simply going to bog down the request time...
|
|
Top
|
|
|
|
|
 |
 |
 |
 |
|
|