I'm only getting one page indexed in my sitemap
« on: May 08, 2008, 09:39:50 PM »
Hi

I seem to have the same problem as a lot of people - only my home page is indexed.

I have tried this script on another site - very similar to tis site and it runs no problem - well I say no problem it runs till it runs out of memory.

What could me problem be please?


Ian
Re: I'm only getting one page indexed in my sitemap
« Reply #1 on: May 09, 2008, 12:12:16 PM »
Same here  >:(
Re: I'm only getting one page indexed in my sitemap
« Reply #2 on: May 09, 2008, 01:33:14 PM »
Hello,

in MOST cases the problem is with specifying Starting URL in sitemap generator options.
Please open your site and check what is the exact domain name in your links, if it is:
http://www.domain.com/page_url.html
then Starting URL must be with www prefix:
Code: [Select]
http://www.domain.com/
If your links look like http://domain.com/page_url.html
then Starting URL must be without ww prefix:
Code: [Select]
http://domain.com/
Please double check that and let me know if that works.
Re: I'm only getting one page indexed in my sitemap
« Reply #3 on: May 09, 2008, 01:42:06 PM »
Hi Oleg,

I've tried all permutations of it now:

with www and without
with http:// and without
also a fully qualified URL direct to my index page

I can PM you the URL to the generator folder if you like so you can take a look

BTW I changed the name of the generator folder, would that make a difference?

Thanks for the reply

Ian Blackford
Re: I'm only getting one page indexed in my sitemap
« Reply #4 on: May 09, 2008, 01:55:38 PM »
Some more info,

When I hit the crawl button, I get a flash of text then I get presented with the page where I can view the sitemap, after a few attempts I got the page to stop and I as able to copy out the text.

It's an error report and looks like this:

With .gz option ON:

Links depth: -
Current page: -
Pages added to sitemap: -
Pages scanned: - (- Kb)
Pages left: - (+ - queued for the next depth level)
Time passed: -
Time left: -
Memory usage: -
Completed
Total pages indexed: 0
Calculating changelog...
Warning: gzwrite(): supplied argument is not a valid stream resource in /www/site/generator/pages/class.xml-creator.inc.php(2) : eval()'d code on line 185

Warning: gzclose(): supplied argument is not a valid stream resource in /www/site/generator/pages/class.xml-creator.inc.php(2) : eval()'d code on line 186

Creating HTML sitemap...
Warning: ksort() expects parameter 1 to be array, null given in /www/site/generator/pages/class.html-creator.inc.php(2) : eval()'d code on line 49

Warning: Invalid argument supplied for foreach() in /www/site/generator/pages/class.html-creator.inc.php(2) : eval()'d code on line 51

With the .gz option OFF:

Warning: fwrite(): supplied argument is not a valid stream resource in /www/site/generator/pages/class.xml-creator.inc.php(2) : eval()'d code on line 185

Warning: fclose(): supplied argument is not a valid stream resource in /www/site/generator/pages/class.xml-creator.inc.php(2) : eval()'d code on line 186

Creating HTML sitemap...
Warning: ksort() expects parameter 1 to be array, null given in /www/site/generator/pages/class.html-creator.inc.php(2) : eval()'d code on line 49

Warning: Invalid argument supplied for foreach() in /www/site/generator/pages/class.html-creator.inc.php(2) : eval()'d code on line 51



I don't know if that is the full error report sorry - but I hope it helps to throw some light on the problem.

Thanks again

Ian
Re: I'm only getting one page indexed in my sitemap
« Reply #5 on: May 09, 2008, 07:54:15 PM »
I think you should rename that folder back to /generator/ because the php script is trying to use it.


Nermi

Re: I'm only getting one page indexed in my sitemap
« Reply #6 on: May 09, 2008, 08:32:58 PM »
Hi

Thanks for the reply, but I don't think that's the case. I altered the paths you see there so as not to expose my domain name or the new name of the generator folder.

Sorry for the confusion.

Ian
Re: I'm only getting one page indexed in my sitemap
« Reply #7 on: May 09, 2008, 08:53:32 PM »
Will see what Oleg says, but I still have a feeling that might be a problem.
And you can always protect your files with password instead of changing the folder name.
But who knows, maybe I am wrong. Been wrong on many things lately.  ;)

Nermi
Re: I'm only getting one page indexed in my sitemap
« Reply #8 on: May 10, 2008, 07:10:24 PM »
It should work fine with other folder names as well. Please PM me your generator URL and I will check it.

Thank you.
Re: I'm only getting one page indexed in my sitemap
« Reply #9 on: May 11, 2008, 11:23:18 AM »
Update: the URLs were blocked in robots.txt file thus none of the pages were included in sitemap. robots.txt shoould be updated in this case.
Re: I'm only getting one page indexed in my sitemap
« Reply #10 on: May 11, 2008, 11:58:24 AM »
Oleg - thank you for your reply.

Sorry I wasn't aware that your crawler would take any notice of the robots.txt file.

If I wanted to enable in the robost.txt file, what should I add?

Thanks once again

Ian
Re: I'm only getting one page indexed in my sitemap
« Reply #11 on: May 11, 2008, 05:25:40 PM »
Sitemap Generator reads robots.txt specifically to avoid further issues - if you will create a sitemap, but will have disallow directive in robots.txt then search engines will not crawl your site. You should remove the "Disallow: /" line from robots file.