using a wilcard to exclude url's with numbers
« on: December 16, 2009, 07:24:24 AM »
Hi,

I have a site with over 300,000 pages.

The url's take the format of:

xxxxx.com/xxx/xxx/xxx.index.html
xxxxx.com/xxx/xxx/xxx.1.html
xxxxx.com/xxx/xxx/xxx.2.html
xxxxx.com/xxx/xxx/yyy/index.html
xxxxx.com/xxx/xxx/yyy/1.html

and so on....

Is there a wildcard option to include the index.html page, but exclude the url's which have a number in them [these are just further paginated pages, which hold no particular value].