crawling problem
« on: January 03, 2008, 03:41:10 AM »
Hi,

I  had the Standalone XML Sitemap Generator and would like to ask how to fix this problem because every time i run the crawling it generate in the sitemap the 2 different link of that same folder one with index.html and the other one is without index.html, for example this url :
[ External links are visible to forum administrators only ] and the other one is  [ External links are visible to forum administrators only ]

 ??? which is wrong is should only generate only 1 link not 2

THANKS,
CASPER
 

Re: crawling problem
« Reply #1 on: January 04, 2008, 12:55:39 AM »
Hello,

you should modify your site to only include one of the links, either "directory/" or "directory/index.html". If you use both of them, Sitemap Generator will find them too.
Re: crawling problem
« Reply #2 on: February 28, 2008, 02:00:04 PM »
hello,

is there another way? I would hate to have to search my site to find all the occurences.

So is there a way to tell the script to ignore the link without index?

Like accepting only url with .php .html and .htm?

thank you in advance

nicolas
Re: crawling problem
« Reply #3 on: February 28, 2008, 11:44:00 PM »
That would be the only reliable way to avoid indexing of both URLs, since search engines will be able to find both links anyway, even if you will include only one in sitemap.