I've got a website. After every article there is a big (7 sentences) preview for another article.
The problem is that google index this preview, and it makes my content duplicate.
How can I tell google not to read those previews?
I've got a website. After every article there is a big (7 sentences) preview for another article.
The problem is that google index this preview, and it makes my content duplicate.
How can I tell google not to read those previews?
You can do this by putting the preview section in a different span that would be hidden by Css properties and displayed using javascript at the time of page loading.Since the section is initially not displayed thus it would not be read by the search engine.
Dummy Html code
<span class="pgPreview">The preview for the next pages goes here </span>
Css code to hide it
.pgPreview {
display:none;
}
Javascript code to show it.
<script type="text/javascript">
$(document).ready(function()
{
$(".pgPreview").show();
}
</script>
Assuming that you have the jquery library included.
One effective though less-than-ideal solution could be to load the random article preview with Javascript. It's similar to how some blogs have a "click to load comments" button - they don't want their blog being associated in Google searches with everything their commenters say.
Since you're keeping the primary content of each page the same, I don't think this would be a cloaking issue, but no guarantees.
I know there used to be a system of labels such as:
<!-- googleon/googleoff: <index|anchor|link|all> -->
...but I do not know if that still works on these days...
also, I think it was only valid if you were using google custom search
.
(do not really know for sure)
In any case, I can think in two ways to do what you need:
lolum dolimus situs amen amdu etr --> normal content nulusan ar sindh ew _____________________ | lolum dolimus | | situs amen amdu etr | --> iframe blocked with | nulusan ar sindh ew | robots.txt file --------------------- _____________________ | lolum dolimus | image with text | situs amen amdu etr | --> unreadable to | nulusan ar sindh ew | search engines ---------------------
Of course, both options are not completely infallible and depends solely on whether the search engine follow the directives in standardization and/or the ability to have OCR recognition.