Questions tagged [accumulo]

The Apache Accumulo sorted, distributed key/value store is based on Google's BigTable design. It is built on top of Apache Hadoop, ZooKeeper, and Thrift. It features a few novel improvements on the BigTable design in the form of cell-level access labels and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.

The Apache Accumulo sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, ZooKeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process. Other notable improvements and feature are outlined here.

Google published the design of BigTable in 2006. Several other open source projects have implemented aspects of this design including HBase, Hypertable, and Cassandra. Accumulo began its development in 2008 and joined the Apache community in 2011.

Official Website: https://accumulo.apache.org/

268 questions
3
votes
1 answer

Geomesa export returns no results despite data being present

Was wondering if you could please help us understand an issue when querying geomesa/Accumulo? Issue: When we query for all fields with geomesa export -u A -p B -c catalog1 -f d1-json -F csv it returns data (see below for sample). When we add a…
mds91
  • 217
  • 2
  • 9
3
votes
1 answer

Failed to execute goal while using Maven to install Accumulo

I am trying to install Accumulo. But I am running into an issue. Here is the command that I am using: mvn package install -X Here is the last part of the execution of the installation where you can clearly see the error: [INFO]…
jnbdz
  • 4,863
  • 9
  • 51
  • 93
3
votes
1 answer

Accumulo MapReduce job fails with java.io.EOFException, using AccumuloRowInputFormat

All my mappers fail with the exception below. I've only shown the last failure for conciseness. Why would this be happening and how do I fix it? 16/09/21 17:01:57 INFO mapred.JobClient: Task Id : attempt_201609151451_0044_m_000002_2, Status :…
Mack
  • 2,614
  • 2
  • 21
  • 33
3
votes
2 answers

odd error when populating accumulo 1.6 mutation object via spark-notebook

using spark-notebook to update an accumulo table. employing the method specified in both the accumulo documentation and the accumulo example code. Below is verbatim what I put into notebook, and the responses: val clientRqrdTble = new…
snerd
  • 1,238
  • 1
  • 14
  • 28
3
votes
2 answers

Accumulo getting stucked and not starting

I've been trying to install Accumulo and try it for a few days but it gets stocked before even starting. I ended up using HortonWorks Sandbox that comes with Hadoop and Zookeeper installed. I followed the instruciton on Accmulo setup page and…
reza
  • 1,188
  • 3
  • 17
  • 32
3
votes
1 answer

Accumulo initialization exception in VirtualBox CentOS 6.4

I am trying to launch Accumulo, and I am running into a nasty exception that seems to be tied to some network problem with my Virtual Machine. I thought the command sudo -u accumulo /usr/lib/accumulo/bin/accumulo init --instance-name lumify…
3
votes
0 answers

Accumulo Java API Connection/Session

I wan wondering if anyone knows what exactly happens when a connection is made from JAVA API or thrift to Accumulo. Client connection is made to ZooKeepers only? My understanding is that after a client is authenticated with ZooKeeper, it directly…
user3293898
  • 271
  • 1
  • 3
  • 8
3
votes
2 answers

How do you use "Range" to Scan an entire table in accumulo

How do you use "Range" to Scan an entire table in accumulo without apriori knowledge? How do folks currently do that. I want to take the random search from:I am looking at writing an Accumulo iterator to return a random sample of a percentile of a…
Chris Rigano
  • 687
  • 1
  • 11
  • 23
3
votes
1 answer

What CAP-Type does Apache Accumulo have?

During the evaluation of several distributed systems I came across the CAP-Theorem. Unfortunately I can't find a classification for Apache Accumulo. Here is the actual question: What kind of cap-type is Apache Accumulo and why?
Matthias Kricke
  • 4,931
  • 4
  • 29
  • 43
3
votes
2 answers

Connecting to Accumulo from NodeJS

I have been trying to connect to Accumulo from NodeJS through the Thrift proxy, but have been unsuccessful. var thrift = require("thrift"); var AccumuloClient = require("./AccumuloProxy"); var transport = thrift.TFramedTransport; var protocol =…
ryknow
  • 161
  • 1
  • 10
3
votes
2 answers

Accumulo createBatchScanner range not working as expected

I cant get a batch scanner to only scan for a specific row, when settings start and stop keys to the same thing I get no entry's back, when using an scanner I get this exception: "java.lang.IllegalArgumentException: Start key must be less than end…
loliman
  • 33
  • 1
  • 3
3
votes
0 answers

accumulo init failed to connect to zookeeper

when i initial the accumulo with the command: accumulo init, error appears, cannot connect to the zookeeper,however the zookeeper is running: INFO:zookeeper is localhost:2181 Thread "init" died null java.lang.reflect.InvocationTargetException at…
myfun
  • 31
  • 2
2
votes
1 answer

Accumulo cluster deployment in Kubernetes

I am trying to use the container from https://github.com/cybermaggedon/accumulo-docker to create a 3 node deployment in the Google Kubernetes Engine. My main problem is how to make the nodes aware of each other. For example, the accumulo/conf/slaves…
user2650994
  • 136
  • 5
2
votes
1 answer

Flink batching Sink

I'm trying to use flink in both a streaming and batch way, to add a lot of data into Accumulo (A few million a minute). I want to batch up records before sending them to Accumulo. I ingest data either from a directory or via kafka, convert the data…
zargarf
  • 633
  • 6
  • 18
2
votes
0 answers

Accumulo on Amazon S3

According to Amazon EMR documentation, HBase can use Amazon S3 as data storage. I am using EMR to run apache accumulo database, is possible to store data in a S3 bucket?
jramirez
  • 155
  • 9
1
2
3
17 18