How can I instruct Nutch to treat page#1 as belonging to a core and page#2 as belonging to a different core (both pages from the same domain)?
Practical situation: let's say Nutch is crawling and indexing www.businessweek.com; let's also say that I have one core called "Japan" and another core called "France".
I want the page http://www.businessweek.com/magazine/content/05_51/b3964049.htm to be indexed only for the France core, since it's relevant for France but irrelevant for Japan.
Consequently, I want the page http://www.businessweek.com/magazine/content/11_27/b4235016555525.htm to be indexed only for the Japan core, but not for France.
Assuming we already know how to identify that a certain page belongs to a specific tag... how can Nutch be instructed about that?