• Welcome to Sitemap Generator Forum.
 

Exclude URLs

Started by accounts16, June 18, 2008, 11:09:42 AM

Previous topic - Next topic

accounts16

Hi,

Do not know what to do. Have a news site with options (links) like: prin, send to friend
URLs are like

[ External links are visible to forum administrators only ]
[ External links are visible to forum administrators only ]
[ External links are visible to forum administrators only ]

so a went for Exclude URLs but excluded the numbers (52, 45 and so on) and the end of the URL.

This does not work

What should I do to have those URLSs Excluded?

Should I put * after sit=      (sit=*)  ?

Matt

CallumW

#1
Hi
Yes, but use a '?' instead if '*'

Heres an example of an exclude list I use on a domain
The SiteGenerator was picking up stuff like: domain/blog/entry.html
domain/blog/entry.html?review=write


Don't need the review/comment on each page, so I ignore it by adding to the exclude list:/*review=write

It follows the same protocol as the robots.txt ('?' = wildcard number, '*' = wildcard character)

CallumW

More info on robots.txt [ External links are visible to forum administrators only ]

http://www.robotstxt.org/orig.html

accounts16

Thank you, I'll give it a go and let you know

accounts16

#4
Quote from: CallumW on June 18, 2008, 01:12:29 PM
More info on robots.txt [ External links are visible to forum administrators only ]

http://www.robotstxt.org/orig.html

Thx for inf.

Q. In Exclude URLs do I go h t t p : // www.... and so on OR local path i.e. modules.php?name=News&new_topic=?&pagenum=?

Matt

CallumW

Just the local path as per my exclude example  ;)

accounts16

Quote from: CallumW on June 18, 2008, 03:15:49 PM
Just the local path as per my exclude example  ;)

Hi,

Eh, I am lost.

The URL goes:
h t t p : / / www. domain/ modules.php?name=News&file=friend&op=FriendSend&sid=21378

I removed h t t p : / / www. domain / - did not help.

Wold you rewrite this url thay way it should work, please.

Matt

CallumW

#7
/*sid=?

if the sid includes chars, not just numbers then do
/*sid=*


All subpages begin / afer the domain
the '*' means anything to the left of your static text 'sid='
The '?' means any number after (or '*' any character after)

;)

accounts16

Thank you, got it  :)

IF anyone what to see the list of those that wokr on my site here it is:

/*sid=?
/*sid=*
/*quote=?
/*quote=*
/*pagenum=?
/*pagenum=*
/*ads=?
/*ads=*
/*mod=?
/*mod=*
*/?/*,?