Questions tagged [hama]

Apache Hama is a pure BSP (Bulk Synchronous Parallel) computing framework on top of HDFS (Hadoop Distributed File System) for massive scientific computations such as matrix, graph and network algorithms.

Why Hama and BSP?

Today, many practical data processing applications require a more flexible programming abstraction model that is compatible to run on highly scalable and massive data systems (e.g., HDFS, HBase, etc). A message passing paradigm beyond Map-Reduce framework would increase its flexibility in its communication capability. Bulk Synchronous Parallel (BSP) model fills the bill appropriately. Some of its significant advantages over MapReduce and MPI are:

  • Supports message passing paradigm style of application development
  • Provides a flexible, simple, and easy-to-use small APIs
  • Enables to perform better than MPI for communication-intensive applications
  • Guarantees impossibility of deadlocks or collisions in the communication mechanisms

Source: The Apache Hama Project

22 questions
5
votes
1 answer

Apache Hama on Amazon Elastic MapReduce

I am trying to run Apache Hama on Amazon Elastic MapReduce using https://github.com/awslabs/emr-bootstrap-actions/tree/master/hama script. However, when trying out with one master node and two slave nodes, peer.getNumPeers() in the BSP code reports…
Madhura Jayaratne
  • 2,204
  • 1
  • 15
  • 20
4
votes
0 answers

Scalability of Apache hama cluster

I am programming for Apache Hama using Hama graph API. I have problem in scalability when running my program in a cluster. The problem is when I increase the number of machines in the cluster I am expecting to have shorter execution time but what I…
2
votes
1 answer

apache hama on EC2 through whirr 0.8

I am trying to take a look at Apache Hama and I followed the instructions at http://hama.apache.org/hama_on_clouds.html and downloaded whirr 0.8.2 but in this version there is not any hama-ec2.properties in the folder recipes, therefore i cannot try…
giulatona
  • 137
  • 2
  • 9
2
votes
1 answer

Is Apache Hama suitable for building a decision tree?

I currently have implemented in Hadoop, Google's framework for building decision trees (also known as PLANET). It starts with a single vertex and with map reduce jobs you add more and more until the tree is fully build. One major problem though is…
jojoba
  • 554
  • 9
  • 19
1
vote
0 answers

Not able to run Apache Hama examples

I installed single node HDFS and local mode Apache Hama 0.7.1 and installations seemed to be successful. But I am not able to test it by running any of the examples provided, like running PiEstimator: $ $HAMA_HOME/bin/hama jar…
1
vote
0 answers

Internal working of Spark - Communication/Synchronization

I am quite new to Spark but already have programming experience in BSP model. In BSP model (e.g. Apache Hama), we have to handle all the communication and synchronization of nodes on our own. Which is good on one side because we have a finer control…
Behroz Sikander
  • 3,885
  • 3
  • 22
  • 36
1
vote
0 answers

Implement breadth-first search using Hama

I've done some research, and I seem to be missing one small part.I understand how a Breadth-First Search works, but I don't understand how to partition nodes so that it can be computed parallely using hama.Is there any method to do it?
Hari K
  • 11
  • 1
1
vote
0 answers

Breadth First Search in Hama

I am doing a project in Apache hama to implement breadth first search and am facing trouble in partitioning the input graph.can anybody suggest a method to do the same? public static class MinIntCombiner extends Combiner
1
vote
2 answers

apache hama installation error on ubuntu 14.04

i want to install and run the hama using the hama installation guide which use port no 54310 in the whole process, but when i want to run some example on hama it gave me an error. i.e. retrying to connect to server: localhost/ 127.0.0.1:40000 here…
hanif s
  • 488
  • 1
  • 3
  • 19
1
vote
2 answers

Whats the main differences between Mapreduce and apache's hama?

Hi I am finding it difficult comparing mapreduce with hama, I understand that hama uses this bulk synchronous parallel model and that the worker nodes can communicate with one another whereas in apache's hadoop the worker nodes only communicate to…
bdavies6086
  • 382
  • 1
  • 5
  • 19
1
vote
1 answer

Where Will I Get Apache Hama-0.6.2 or Older version of HAMA?

Can anyone provide me any link to apache hama 0.6.2 or earlier release? I am getting 0.6.3 only in the official site. But i need the earleir versions
Debashisenator
  • 1,621
  • 4
  • 17
  • 16
1
vote
1 answer

Runtime partition failed for this job in Hama BSP

I encountered the following problem when start running a hama BSP job. This exception occurs when hama tries to load and partition the input data before it actually runs my own code. This is a known problem discussed in some websites but…
keelar
  • 5,814
  • 7
  • 40
  • 79
1
vote
1 answer

Exception in Hama BSP

My hama throws the following exception during the input data partition phase before actually running my BSP job. Can I know what are the possible root causes of this exception? Any suggestions about how to find out the root cause is appreciated. …
keelar
  • 5,814
  • 7
  • 40
  • 79
1
vote
1 answer

Does Apache Hama like Hadoop Mapreduce has a monitoring webpage?

Hadoop mapreduce's job tracker has a monitoring webpage at its port 50030, but when I tried to open a browser with my Hama BSPMaster node on its service port (which I set to 40000), it seems that Hama does not have such monitoring page. The website…
keelar
  • 5,814
  • 7
  • 40
  • 79
1
vote
0 answers

Unable to use apache hama(install) on ubuntu 12.04 LTS

This is the configuration file I…
Galaxy
  • 743
  • 1
  • 8
  • 26
1
2