• Welcome to Sitemap Generator Forum.
 

There was an error while retrieving the URL specified

Started by financeira, October 02, 2012, 12:57:26 PM

Previous topic - Next topic

financeira

Hello, since i moved my website to a new dedicated server, xml sitemap generator stop working, now i always get the follow message when start crawlling my website:

There was an error while retrieving the URL specified: [ External links are visible to forum administrators only ]
HTTP Code:
HTTP/1.1 403 Forbidden
HTTP headers:
server: nginx
date: Tue, 02 Oct 2012 11:51:30 GMT
content-type: text/html
content-length: 132
connection: close
set-cookie: d4dad6935f632ac35975e3001dc7bbe8=evgd6kq48ehnfl31uc6a5sprg2; path=/
p3p: CP="NOI ADM DEV PSAi COM NAV OUR OTRo STP IND DEM"
x-powered-by: PleskLin
ms-author-via: DAV
vary

I believe this is a server restriction, but since my server is not managed i was hoping someone could point me for a resolution of this problem.

I checked the htaccess and i dont have any restriction there, my website is running joomla and deactived the pluginsthat might be causing this problem like the firewall of joomla, i also disabled the sever firewall and the problem still remains.

Any help are very welcome

Sincerely
Daniel Q.


financeira

Hello, sorry for replying so late but ive been busy this last days.
Regarding your question yes i do have one .htaccess with the follow content:


##
# @version $Id: htaccess.txt 21064 2011-04-03 22:12:19Z dextercowley $
# @package Joomla
# @copyright Copyright (C) 2005 - 2010 Open Source Matters. All rights reserved.
# @license http://www.gnu.org/copyleft/gpl.html GNU/GPL
# Joomla! is Free Software
##


#####################################################
#  READ THIS COMPLETELY IF YOU CHOOSE TO USE THIS FILE
#
# The line just below this section: 'Options +FollowSymLinks' may cause problems
# with some server configurations.  It is required for use of mod_rewrite, but may already
# be set by your server administrator in a way that dissallows changing it in
# your .htaccess file.  If using it causes your server to error out, comment it out (add # to
# beginning of line), reload your site in your browser and test your sef url's.  If they work,
# it has been set by your server administrator and you do not need it set here.
#
#####################################################

##  Can be commented out if causes errors, see notes above.
Options +FollowSymLinks

#
#  mod_rewrite in use

RewriteEngine On

########## Begin - Rewrite rules to block out some common exploits
## If you experience problems on your site block out the operations listed below
## This attempts to block the most common type of exploit `attempts` to Joomla!
#
## Deny access to extension xml files (uncomment out to activate)
#<Files ~ "\.xml$">
#Order allow,deny
#Deny from all
#Satisfy all
#</Files>
## End of deny access to extension xml files
# Block out any script trying to set a mosConfig value through the URL
RewriteCond %{QUERY_STRING} mosConfig_[a-zA-Z_]{1,21}(=|\%3D) [OR]
# Block out any script trying to base64_encode data within the URL
RewriteCond %{QUERY_STRING} base64_encode[^(]*\([^)]*\) [OR]
# Block out any script that includes a <script> tag in URL
RewriteCond %{QUERY_STRING} (<|%3C)([^s]*s)+cript.*(>|%3E) [NC,OR]
# Block out any script trying to set a PHP GLOBALS variable via URL
RewriteCond %{QUERY_STRING} GLOBALS(=|\[|\%[0-9A-Z]{0,2}) [OR]
# Block out any script trying to modify a _REQUEST variable via URL
RewriteCond %{QUERY_STRING} _REQUEST(=|\[|\%[0-9A-Z]{0,2})
# Return 403 Forbidden header and show the content of the root homepage
RewriteRule .* index.php [F]
#
########## End - Rewrite rules to block out some common exploits


########## Begin - Custom redirects
#
# If you need to redirect some pages, or set a canonical non-www to
# www redirect (or vice versa), place that code here. Ensure those
# redirects use the correct RewriteRule syntax and the [R=301,L] flags.
# -- Redirect requests to IP address + requests for non-www
# -- (Canonicalize to WWW.DN.TLD)
# ----------------------------------

#
########## End - Custom redirects


#  Uncomment following line if your webserver's URL
#  is not directly related to physical file paths.
#  Update Your Joomla! Directory (just / for root)

# RewriteBase /


########## Begin - Joomla! core SEF Section
#
RewriteRule .* - [E=HTTP_AUTHORIZATION:%{HTTP:Authorization}]
#
# If the requested path and file is not /index.php and the request
# has not already been internally rewritten to the index.php script
RewriteCond %{REQUEST_URI} !^/index\.php
# and the request is for root, or for an extensionless URL, or the
# requested URL ends with one of the listed extensions
RewriteCond %{REQUEST_URI} (/[^.]*|\.(php|html?|feed|pdf|raw))$ [NC]
# and the requested path and file doesn't directly match a physical file
RewriteCond %{REQUEST_FILENAME} !-f
# and the requested path and file doesn't directly match a physical folder
RewriteCond %{REQUEST_FILENAME} !-d
# internally rewrite the request to the index.php script
RewriteRule .* index.php [L]
#
########## End - Joomla! core SEF Section
RewriteCond %{HTTPS} ^on$
RewriteCond %{REQUEST_URI} ^/robots.txt$
RewriteRule ^(.*)$ /robots_https.txt [L]


Is there anything wrong with my htaccess?

Sincerely
Daniel

XML-Sitemaps Support

.htaccess looks ok. Please check that you don't have any IPs blocked in Joomla (for instance you might have "flood block" option enabled).