How Name node update availability of Data Nodes for HDFS writes in Hadoop

Question

I have 10 Data nodes the replication factor is 3,file size is 150 and the block size is 64. So file will be splits into three blocks B1,B2,B3. So client asks Name Node for the availability of Data nodes for writing B1 block. My question is how many Data nodes will be provided to client by Name node for writing B1 block. And one more question is how the writes for B1,B2,B3 will be happend either parallelly or sequentially

Is there a particular problem you're encountering/trying to solve? This seems like you're just asking for background on how Hadoop works. — flyingmeatball, Jan 25 '17 at 14:44

score 2 · Accepted Answer · answered Jan 25 '17 at 14:49

2

Data will be written to just one datanode by client, rest replication is taken care by the datanodes itself on namenode instruction. Replica placement: while a datanode receives data of the block from the client, the datanode saves the data in a file, which represents the block, and, simultaneously re-sends the data to another datanode, which is supposed to create another replica of the block.
Blocks are copied sequentially

answered Jan 25 '17 at 14:49

piyush pankaj

725
1
12
24

B1,B2,B3 blocks will be stored sequentially or parallelly?I mean first B1 isstored next B2 and then B3? – sidhartha pani Jan 25 '17 at 14:51
Sequentially as i have written in the answer above – piyush pankaj Jan 25 '17 at 14:52

How Name node update availability of Data Nodes for HDFS writes in Hadoop

1 Answers1