XML Sitemaps Generator

Author Topic: Error Warning: preg_match() [function.preg-match]: Compilation failed + help  (Read 10031 times)

jb92500

  • Registered Customer
  • Approved member
  • *
  • Posts: 6
Hello I have few problems. Since 10 days now my sitemap isn't working. I noticed only now because I bought the video sitemap option. When I start the crawling the following error occur :

Warning: preg_match() [function.preg-match]: Compilation failed: regular expression is too large at offset 35939 in /home/realityg/public_html/generator/pages/class.grab.inc.php on line 107

Warning: preg_match() [function.preg-match]: Compilation failed: regular expression is too large at offset 35939 in /home/realityg/public_html/generator/pages/class.grab.inc.php on line 192

It's saying the second error unlimited amount of times.

So I need your help plz.

- My second problem was, is there a way that all the link that haven't changed are not crawled each time, because now my crawling is 7hours long.

Thanks for the help :)

XML-Sitemaps Support

  • Administrator
  • Hero Member
  • *****
  • Posts: 10621
Hello,

did you modify "Exclude URLs" or "Do not parse" settings in generator config? If yes, what are the values you did set?


> - My second problem was, is there a way that all the link that haven't changed are not crawled each time, because now my crawling is 7hours long.

Unfortunately, generator will have no way to find if pages were actually changed unless they crawl them, so it only could work if you know the URLs (or parts of URLs) for those pages and add them in "Do not parse" setting in generator configuration.
Oleg Ignatiuk
www.xml-sitemaps.com
Send me a Private Message

For maximum exposure and traffic for your web site check out our additional SEO Services.

jb92500

  • Registered Customer
  • Approved member
  • *
  • Posts: 6
[external links are visible to admins only]
Hello,

did you modify "Exclude URLs" or "Do not parse" settings in generator config? If yes, what are the values you did set?


> - My second problem was, is there a way that all the link that haven't changed are not crawled each time, because now my crawling is 7hours long.

Unfortunately, generator will have no way to find if pages were actually changed unless they crawl them, so it only could work if you know the URLs (or parts of URLs) for those pages and add them in "Do not parse" setting in generator configuration.

Thanks for your awnser. In generator.conf I changed <option name="xs_robotstxt"> to 0 (it was set on 1).

And is there a way that I don't generate my URLs anymore, but I only generate the video sitemap ?

XML-Sitemaps Support

  • Administrator
  • Hero Member
  • *****
  • Posts: 10621
Hello,

generator still needds to crawl your website to *find* pages with videos. In case if they are in a separate section of your site, for instance in /video/ subfolder, you can use it as Starting URL and other pages will not get crawled.
Oleg Ignatiuk
www.xml-sitemaps.com
Send me a Private Message

For maximum exposure and traffic for your web site check out our additional SEO Services.

jb92500

  • Registered Customer
  • Approved member
  • *
  • Posts: 6
I see thanks for your awnser.

One last question, the crawling process was fined and like 280 videos were found on my site.

BUT (it's a forum), these 280 videso were in the media part of my site, where only videos are psoted, and all the video embeded on my threads haven't been found.

Why the crawling process didn't found the video I have on my topics ?

Tx.

XML-Sitemaps Support

  • Administrator
  • Hero Member
  • *****
  • Posts: 10621
What is an example URL of the page with embedded video that was not included?
Oleg Ignatiuk
www.xml-sitemaps.com
Send me a Private Message

For maximum exposure and traffic for your web site check out our additional SEO Services.

jb92500

  • Registered Customer
  • Approved member
  • *
  • Posts: 6
mysite.com/media = embeded
mysite.com/threads/name of the thread = not embeded (videos I mean).


XML-Sitemaps Support

  • Administrator
  • Hero Member
  • *****
  • Posts: 10621
I need specific example URLs that I can open in browser to check the issue.
Oleg Ignatiuk
www.xml-sitemaps.com
Send me a Private Message

For maximum exposure and traffic for your web site check out our additional SEO Services.

 

SMF 2.0.12 | SMF © 2014, Simple Machines
XHTML RSS WAP2