yoc

*
  • *
  • 18
Txt files not broken down
« on: June 10, 2019, 06:06:44 PM »
After generating over 300k links and building sitemaps the urllist. txt map remains one big file. With limits of 50k set by search engines this is a problem. ROR file is also not split. Solution?
Re: Txt files not broken down
« Reply #1 on: June 11, 2019, 04:41:25 AM »
Hello,

50K limit applies to xml sitemap only.

yoc

*
  • *
  • 18
Re: Txt files not broken down
« Reply #2 on: June 11, 2019, 05:57:36 PM »
FROM google webmasters tools:

Sitemaps
/urllist.txt

Last read
6/11/19
Discovered URLs
50,000

Sitemap can be read, but has errors
Too many URLs
1 instance
Your Sitemap contains too many URLs. Please create multiple Sitemaps with up to 50000 URLs each and submit all Sitemaps.
Examples
Line 50001

yoc

*
  • *
  • 18
Re: Txt files not broken down
« Reply #3 on: June 11, 2019, 05:59:41 PM »
FROM THE WEB:

According to source : A sitemap file can't contain more than 50,000 URLs and must be no larger than 10 MB uncompressed. If your Sitemap is larger than this, break it into several smaller Sitemaps. These limits help ensure that your web server is not overloaded by serving large files to Google.

yoc

*
  • *
  • 18
Re: Txt files not broken down
« Reply #4 on: June 11, 2019, 06:01:14 PM »
FROM Google:
All formats limit a single sitemap to 50MB (uncompressed) and 50,000 URLs. If you have a larger file or more URLs, you will have to break your list into multiple sitemaps. You can optionally create a sitemap index file (a file that points to a list of sitemaps) and submit that single index file to Google. You can submit multiple sitemaps and/or sitemap index files to Google.
Re: Txt files not broken down
« Reply #5 on: June 13, 2019, 02:47:42 PM »
Hello,

you only need to submit xml sitemap to google, not the text (as it contains the same list of links).

yoc

*
  • *
  • 18
Re: Txt files not broken down
« Reply #6 on: June 13, 2019, 06:20:22 PM »
I know I only NEED to send Google after the .xml file, but Googles crawls the .txt files way better, already Google has crawled and indexed the .txt files (I had to split them manually) way more aggressively than the .xml files (after many years) - been doing this since 1996.

Change is critical and the internet changes all the time....
Re: Txt files not broken down
« Reply #7 on: June 13, 2019, 08:11:07 PM »
Hello,

splitting text sitemap is not currently supported by sitemap generator script.