There was an error while retrieving the URL specified
« on: October 02, 2012, 12:57:26 PM »
Hello, since i moved my website to a new dedicated server, xml sitemap generator stop working, now i always get the follow message when start crawlling my website:

There was an error while retrieving the URL specified: [ External links are visible to forum administrators only ]
HTTP Code:
HTTP/1.1 403 Forbidden
HTTP headers:
server: nginx
date: Tue, 02 Oct 2012 11:51:30 GMT
content-type: text/html
content-length: 132
connection: close
set-cookie: d4dad6935f632ac35975e3001dc7bbe8=evgd6kq48ehnfl31uc6a5sprg2; path=/
p3p: CP="NOI ADM DEV PSAi COM NAV OUR OTRo STP IND DEM"
x-powered-by: PleskLin
ms-author-via: DAV
vary

I believe this is a server restriction, but since my server is not managed i was hoping someone could point me for a resolution of this problem.

I checked the htaccess and i dont have any restriction there, my website is running joomla and deactived the pluginsthat might be causing this problem like the firewall of joomla, i also disabled the sever firewall and the problem still remains.

Any help are very welcome

Sincerely
Daniel Q.
Re: There was an error while retrieving the URL specified
« Reply #1 on: October 04, 2012, 12:21:00 AM »
Hello,

do you have .htaccess file in your domain root folder? It might block requests from generator.
Re: There was an error while retrieving the URL specified
« Reply #2 on: October 10, 2012, 10:34:19 AM »
Hello, sorry for replying so late but ive been busy this last days.
Regarding your question yes i do have one .htaccess with the follow content:


Code: [Select]
##
# @version $Id: htaccess.txt 21064 2011-04-03 22:12:19Z dextercowley $
# @package Joomla
# @copyright Copyright (C) 2005 - 2010 Open Source Matters. All rights reserved.
# @license http://www.gnu.org/copyleft/gpl.html GNU/GPL
# Joomla! is Free Software
##


#####################################################
#  READ THIS COMPLETELY IF YOU CHOOSE TO USE THIS FILE
#
# The line just below this section: 'Options +FollowSymLinks' may cause problems
# with some server configurations.  It is required for use of mod_rewrite, but may already
# be set by your server administrator in a way that dissallows changing it in
# your .htaccess file.  If using it causes your server to error out, comment it out (add # to
# beginning of line), reload your site in your browser and test your sef url's.  If they work,
# it has been set by your server administrator and you do not need it set here.
#
#####################################################

##  Can be commented out if causes errors, see notes above.
Options +FollowSymLinks

#
#  mod_rewrite in use

RewriteEngine On

########## Begin - Rewrite rules to block out some common exploits
## If you experience problems on your site block out the operations listed below
## This attempts to block the most common type of exploit `attempts` to Joomla!
#
## Deny access to extension xml files (uncomment out to activate)
#<Files ~ "\.xml$">
#Order allow,deny
#Deny from all
#Satisfy all
#</Files>
## End of deny access to extension xml files
# Block out any script trying to set a mosConfig value through the URL
RewriteCond %{QUERY_STRING} mosConfig_[a-zA-Z_]{1,21}(=|\%3D) [OR]
# Block out any script trying to base64_encode data within the URL
RewriteCond %{QUERY_STRING} base64_encode[^(]*\([^)]*\) [OR]
# Block out any script that includes a <script> tag in URL
RewriteCond %{QUERY_STRING} (<|%3C)([^s]*s)+cript.*(>|%3E) [NC,OR]
# Block out any script trying to set a PHP GLOBALS variable via URL
RewriteCond %{QUERY_STRING} GLOBALS(=|\[|\%[0-9A-Z]{0,2}) [OR]
# Block out any script trying to modify a _REQUEST variable via URL
RewriteCond %{QUERY_STRING} _REQUEST(=|\[|\%[0-9A-Z]{0,2})
# Return 403 Forbidden header and show the content of the root homepage
RewriteRule .* index.php [F]
#
########## End - Rewrite rules to block out some common exploits


########## Begin - Custom redirects
#
# If you need to redirect some pages, or set a canonical non-www to
# www redirect (or vice versa), place that code here. Ensure those
# redirects use the correct RewriteRule syntax and the [R=301,L] flags.
# -- Redirect requests to IP address + requests for non-www
# -- (Canonicalize to WWW.DN.TLD)
# ----------------------------------

#
########## End - Custom redirects


#  Uncomment following line if your webserver's URL
#  is not directly related to physical file paths.
#  Update Your Joomla! Directory (just / for root)

# RewriteBase /


########## Begin - Joomla! core SEF Section
#
RewriteRule .* - [E=HTTP_AUTHORIZATION:%{HTTP:Authorization}]
#
# If the requested path and file is not /index.php and the request
# has not already been internally rewritten to the index.php script
RewriteCond %{REQUEST_URI} !^/index\.php
# and the request is for root, or for an extensionless URL, or the
# requested URL ends with one of the listed extensions
RewriteCond %{REQUEST_URI} (/[^.]*|\.(php|html?|feed|pdf|raw))$ [NC]
# and the requested path and file doesn't directly match a physical file
RewriteCond %{REQUEST_FILENAME} !-f
# and the requested path and file doesn't directly match a physical folder
RewriteCond %{REQUEST_FILENAME} !-d
# internally rewrite the request to the index.php script
RewriteRule .* index.php [L]
#
########## End - Joomla! core SEF Section
RewriteCond %{HTTPS} ^on$
RewriteCond %{REQUEST_URI} ^/robots.txt$
RewriteRule ^(.*)$ /robots_https.txt [L]

Is there anything wrong with my htaccess?

Sincerely
Daniel
Re: There was an error while retrieving the URL specified
« Reply #3 on: October 10, 2012, 10:49:12 PM »
.htaccess looks ok. Please check that you don't have any IPs blocked in Joomla (for instance you might have "flood block" option enabled).