What happens when a denied page (robots) is still in sitemap.xml? - SEO - PhpOut

Valentin
June 29, 2017
272 views
1 vote
2 Answers

I want to prevent a page from being indexed, along with its assets (images).

So if I tell crawlers to skip that page, but that page is still registered in sitemap.xml, will any information on that page be indexed?

Tags: robots.txt seo sitemap sitemap.xml web-crawler

Answers

- unor
- June 29, 2017 at 3:28 pm
- 0 votes
0
robots.txt disallows crawling, not indexing.

If you disallow crawling of a URL in your robots.txt, and you list this URL in your sitemap, it is still disallowed to be crawled. Occurrence in a sitemap doesn’t change this.

This URL might still be indexed, though (whether it’s in the sitemap or not).

Login or Signup to reply.

- JulienNioche
- June 30, 2017 at 9:55 am
- 0 votes
0
Just to add to the previous answer, you can use the Noindex directive in your robots.txt file. It is not part of the standard AFAIK but is commonly used, see blog – although there seem to be diverging opinions about it. Alternatively, you could use the robots meta tags in your webpages.

As usual, there is no guarantee that all crawlers will respect the robots directives, however the main ones will.

Login or Signup to reply.

Please signup or login to give your own answer.

Click here to cancel reply.