How to exclude specific pages from being crawled by the site auditor
Click the settings button at the top right of your site auditor dashboard, and navigate to the "Crawl Settings". Click "edit" to open the crawl settings:
To exclude specific paths or URLs from being crawled by the site auditor, enter the paths to exclude in the "Excluded URLs / Paths" box in the site crawl settings screen. Once you've entered a path, click the "+" button to the right. Once you've added paths to exclude, click the "Save" button.
Paths are excluded using a contains exclusion criteria.
...and so on.
If you wanted to exclude the entire blog and all blog posts from being crawled, you would enter "blog" in the exclusions box.
Keep in mind that any URL which includes the word "blog" will also be excluded in this example.
Let's say that the campaign URL is www.agencyanalytics.com/, and the site hosts listings for products across multiple categories.
Listings are in the format of:
...and so on.
If you wanted to exclude all listing pages from being crawled, you would enter the "listing" in the exclusions box.
If you wanted to exclude all listings in Canada only, you would enter "canada/listing" in the exclusion box.
Note: The exclusions box doesn't currently accept wild cards or regular expressions, but this functionality is on the roadmap and will likely be released at a future date.
Removing path and URL exclusions
You can remove exclusion rules by heading to the Crawl Settings page and clicking the "Remove Icon" (pictured as a garbage can) next to the exclusion rule you want to remove.