I want to have separate data nodes for different projects(don't want to share data nodes between two project's)
I can see option in Cloudera Manger to have two different clusters using shared cloudera manager. So can i have separate cluster of just data nodes and share HDFS and Yarn managers with earlier cluster. Yarn/HDFS master services will have to have two separate fsimage's/edit logs and Resource Manger i guess(Or what ever server's configuration I will have on My master nodes those will be shared with new cluster data nodes and client process installed on it.)
Is it possible? Has anyone done that before. hows performance ? I am referring this document -> [http://www.cloudera.com/documentation/archive/manager/4-x/4-5-1/Cloudera-Manager-Enterprise-Edition-User-Guide/cmeeug_topic_6.html][ cloudera documentation for multi cluster using one CM]
Also can we set some rule for HDFS to store/use particular set of data nodes for particular set of data/directory only so that separation can be achived ?
Thanks in advance.