We have an Umbraco site in a load balanced environment and we need to make sure only the actual URL gets crawled and not the different production URLs.
We only want example.com
to be indexed while load balancers at production1.example.com
and production2.example.com
are not.
Do I add a disallow rule for those URLs to the robots.txt, or add a meta nofollow
tag to the head? Or is there another way to have the load balancing URLs not indexed by crawlers?