Questions tagged [giraph]

Apache Giraph is an iterative graph processing system built for high scalability.

Apache Giraph is an iterative graph processing system built for high scalability.

For example, it is currently used at Facebook to analyze the social graph formed by users and their connections.

Giraph originated as the open-source counterpart to Pregel, the graph processing architecture developed at Google and described in this paper.

Both systems are inspired by the Bulk Synchronous Parallel model of distributed computation introduced by Leslie Valiant.

Bulk Synchronous Parallel (BSP) abstract computer is a bridging model for designing parallel algorithms. It differs from Parallel random access machine (PRAM) by not talking communication and synchronization for granted. An important part of analyzing a BSP algorithm rests in qualifying the synchronization and the communication needed.

Giraph adds several features beyond the basic Pregel model, including master computation, sharded aggregators, edge-oriented input, out-of-core computation, and more.

With a steady development cycle and a growing community of users worldwide, Giraph is a natural choice for unleashing the potential of structured datasets at a massive scale.

References

156 questions
0
votes
1 answer

Apache Giraph on Cloudera VM - ClassNotFoundException

I am new to Hadoop/Giraph and Java. As part of a task, I downloaded Cloudera Quickstart VM and Giraph on top of it. I am using this book named "Practical Graph Analytics with Apache Giraph; Authors: Shaposhnik, Roman, Martella, Claudio, Logothetis,…
user9068137
0
votes
1 answer

Array vertex values for Apache Giraph

I need to process a graph which has arrays (double) for their vertex value rather than a single double. So the input format would be vertex_Id(string)\t vertex_value (array)\t neighbors. Looking at the documentation, it seems that there is no class…
vmontazeri
  • 393
  • 2
  • 20
0
votes
0 answers

Giraph is using incorrect version of guava

I am submitting a giraph job to a hadoop 1.2.1 cluster. The job is failing with the following stack trace: 2017-04-21 20:37:41,082 ERROR org.apache.hadoop.mapred.Child: Error running child : java.lang.NoSuchMethodError:…
cscan
  • 3,684
  • 9
  • 45
  • 83
0
votes
1 answer

number of remote messages Giraph

I am new in Giraph and Hadoop. Is there any way to count the number of local messages and remote messages (transferred over network) exchanged between vertices of the graph?
imen
  • 1
  • 2
0
votes
1 answer

Install Apache Giraph on a machine with no internet connection

I want to install Apache Giraph 1.1.0 on a RedHat cluster, Hadoop 2.7.1 is already installed and configured on the cluster. In the official tutorial http://giraph.apache.org/quick_start.html, it is recommended to download source code of Giraph and…
0
votes
1 answer

apache giraph: java.lang.IllegalAccessError

I have been trying to run an Apache Giraph job on an Ubuntu server with Hadoop 2.7.3 and Giraph 1.3.0. I have also tried to run the same job on Hadoop 2.6 with lower versions of Giraph (1.2/1.1). But I receive the following error in both the…
0
votes
1 answer

Hadoop-2.7.2: How manage resources

I use a server with 16 cores, 64 GB ram, 2.5 TB disk and I want to execute a Giraph program. I have installed hadoop-2.7.2 and I don't know how can configure hadoop to use only a partial amount of server resources because the server used by many…
user349
  • 11
  • 4
0
votes
1 answer

Conversion of graph data from Edge Input Format to Vertex Input Format

I am experimenting around with Giraph. To run the algorithms in Giraph I need the graph data to be in Vertex Input Format. Almost all the available Big Data online is in Edge List Format. I wrote a code in Java to convert this Edge List format into…
natcomp256
  • 706
  • 7
  • 13
0
votes
1 answer

Eclipse IDE and Giraph

I would like to use Eclipse IDE to create a new Giraph program. How can I include Giraph into Eclipse?. I'm new in this topic. I used Shell Environment for running the Giraph program and text edit to write a program. But using JAVA IDE seems the…
maria
  • 1
  • 1
0
votes
1 answer

Giraph cannot set a slightly larger superstep value?

When I set superstep to 20, it works well. But when I set superstep to 200, it doesn't work. hadoop jar Test-jar-with-dependencies.jar org.apache.giraph.GiraphRunner test.Test -mc test.TestMC -vif…
0
votes
3 answers

Could not format the Namenode in hadoop

While formatting namenode by executing the command " bin/hadoop namenode -format " i get this error: hduser@ubuntu:~/hadoop/bin$ ./hadoop namenode -format DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs…
Y.Asmae
  • 61
  • 6
0
votes
0 answers

running program with giraph

I'm trying to run an example of graph's algorithms but I'm getting an error that the basicComputation is not found , but in my code I imported BasicComputation import org.apache.giraph.Algorithm; import…
Salma Hakim
  • 105
  • 8
0
votes
3 answers

Giraph Job running in local mode always

I ran Giraph 1.1.0 on Hadoop 2.6.0. The mapredsite.xml looks like this mapreduce.framework.name yarn The runtime framework for executing MapReduce jobs. Can be one of …
0
votes
1 answer

Apache Giraph: Cannot modify edge value via setEdgeValue()

This code is borrowed from the book "Practical Graph Analytics with Apache Giraph". My code's goal is to transform a graph from using only one-direction edge into using two-direction edges. My data is a tab-delimited text file to…
Marc Kees
  • 206
  • 2
  • 15
0
votes
1 answer

Giraph application get stuck, on superstep 4, all workers active but without progress

I'm doing BFS search through the Wikipedia (spanish edition) site. I converted the dump (https://dumps.wikimedia.org/eswiki/20160601) into a file that could be read with Giraph. The BFS is searching for paths, and its all ok until get stuck in some…
chomp
  • 1,352
  • 13
  • 31