0

I have to setup a cluster, on my computer using 5 virtual machines with hadoop. The configuration requires a port number. Can someone enlighten me on this.I am a beginner in it

DB cooper
  • 115
  • 3
  • 11

2 Answers2

3

If your primary objective is to learn Hadoop then it does not matter you learn it on Windows or Linux, because everything is exactly same on both platforms. I have extensively used Hadoop on both platform and found all the commands and processing are identical on Windows and Linux. So here are my suggestions:

  1. Download VMware VMPlayer on your Windows/Linux Machine
  2. Download CDH Virtual Machine for VMware

    https://ccp.cloudera.com/display/SUPPORT/Downloads

Access virtual machine in your Windows/Linux box and follow the tutorials exactly they are on Linux.

Same info is shared here:

Hadoop on Windows

Community
  • 1
  • 1
AvkashChauhan
  • 20,495
  • 3
  • 34
  • 65
  • I have to configure the hadoop. The nodes need a port number, so can you help me regarding that what port number to give – DB cooper May 11 '12 at 23:10
  • 1
    The default Hadoop is set to run with port 50030 and 50070 so you don't need to play with any Port setting. Either you don't examplain what your actual requirements are or you dont know what you are doing. The best option is to download the cloudera VM and then learn how it works and then try to run your own cluster. – AvkashChauhan May 12 '12 at 15:37
0

Its upto you to choose the port. Normally people use the default ports provided by hadoop. For the default ports, see this. There will be absolutely no harm if you use those ports (unless you have something else running on any of those ports).

Tejas Patil
  • 6,149
  • 1
  • 23
  • 38