Questions tagged [accumulo]

The Apache Accumulo sorted, distributed key/value store is based on Google's BigTable design. It is built on top of Apache Hadoop, ZooKeeper, and Thrift. It features a few novel improvements on the BigTable design in the form of cell-level access labels and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.

The Apache Accumulo sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, ZooKeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process. Other notable improvements and feature are outlined here.

Google published the design of BigTable in 2006. Several other open source projects have implemented aspects of this design including HBase, Hypertable, and Cassandra. Accumulo began its development in 2008 and joined the Apache community in 2011.

Official Website: https://accumulo.apache.org/

268 questions

votes

0 answers

Accumulo custom Filter is not getting called correctly on scan

I have written a custom Filter which is pretty much just a ripoff of the provided AgeOffFilter with some extra stuff. My goal is to remove entries from Accumulo based on a custom time field. public class CustomAgeOffFilter extends Filter { It has…

asked Oct 11 '17 at 13:21

Smittey

2,475
10
28
35

votes

1 answer

Multiple table input for mapreduce

I am thinking of doing a mapreduce using accumulo tables as input. Is there a way to have 2 different tables as input, the same way it exists for the multiple files input like addInputPath ? Or is it possible to have one input from a file and the…

hadoop mapreduce accumulo

asked Jun 26 '17 at 08:41

user3794667384

votes

2 answers

Spark streaming + Accumulo - Serialize BatchWriterImpl

I'm looking for Spark Streaming + Accumulo connectors and full usage example. Currently I'm trying to write Spark Streaming results to Accumulo table but I'm getting NotSerializableException for BatchWriter. Can someone point me to examples of how…

apache-spark spark-streaming accumulo

asked May 08 '17 at 07:55

Marcin

votes

2 answers

Using GeoMesa Native API to insert data in accumulo

I am trying to insert and read data from accumulo storage using GeoMesa Native API. I have created a class file to use geomesa accumulo storage natively. Here is my java code : package org.locationtech.geomesa.api; import…

java accumulo geomesa

asked Apr 01 '17 at 10:19

Suresh Prajapati

3,991
5
26
38

votes

1 answer

GeoMesa - Differences between the supported Data Stores?

I am skimming through the documentation of GeoMesa and the table of contents exposes a number of supported data stores that can be used: Specific back-end implementations are described in the following chapters: Accumulo Data Store Kafka…

geoserver accumulo geomesa

asked Mar 15 '17 at 10:13

j9dy

2,029
3
25
39

votes

1 answer

GeoMesa export query error

Problem: When I try to run below query it gives error. Is there something missing in the CQL query or is a jar file missing ? Also whether it is just a GeoMesa error or accumulo error. Is there any other way to query GeoMesa and get results in a CSV…

cql accumulo geomesa

asked Mar 14 '17 at 11:33

kaxil

17,706
2
59
78

votes

1 answer

How to query a gemesa-accumulo feature using command line

I ingested data in geomesa accumulo using sfts and converters, Data was ingested successfully and i can visualise the same data using geoserver plugin. I want to filter feature data using command line but however not able to find any commands to do…

cql accumulo geomesa

asked Mar 06 '17 at 12:13

Suresh Prajapati

3,991
5
26
38

votes

1 answer

Start-all.sh for Accumulo throws multiple warnings about: Max open files on localhost

I am getting multiple warnings about: WARN : Max open files on localhost is 1024, recommend 32768 I am unsure what this is about. /usr/local/accumulo$ ./bin/start-all.sh & [1] 27142 /usr/local/accumulo$ Starting monitor on localhost WARN : Max open…

ubuntu hadoop networking apache-zookeeper accumulo

asked Dec 01 '16 at 15:28

jnbdz

4,863
9
51
93

votes

1 answer

Iterating over accumulo table with scala

I have a table in accumulo named records with a few families and qualifiers per row_id, it looks like this in accumulo shell. michaelp@accumulo records> scan 2016-10-17 16:27:55,359 [Shell.audit] INFO : michaelp@accumulo records> scan E001…

scala accumulo

asked Oct 17 '16 at 20:42

Mike

6,813
4
29
50

votes

3 answers

How do you debug accumulo iterators and filters?

I am using logging for debugging, however it's extremely inefficient. What other approaches are there?

accumulo

asked Aug 04 '16 at 12:50

kgu87

2,050
14
12

votes

1 answer

Read from Accumulo with Spark Shell

I try to use the spark shell to connect to an Accumulo Table I load spark and the libraries I need like this: $ bin/spark-shell --jars…

apache-spark kryo accumulo

asked Jul 18 '16 at 16:36

Martin

votes

2 answers

How to restore an instance in Accumulo 1.7.0?

I configured Accumulo 1.7.0 with Hadoop 2.6.0 (HDFS) and Zookeeper 3.4.6, all works good, but i want to know how to restore an instance. Thanks !!! UPDATE The problem is that i want to recover the instance after restart the PC or stop all…

hdfs apache-zookeeper accumulo

asked Feb 17 '16 at 16:27

WilD

votes

1 answer

Connection to Accumulo just hangs on initialization

I have Accumulo running in a VM. This Groovy script will connect just fine from within the VM, but outside of the VM it hangs at the first println statement. String instance = "test" String zkServers = "localhost:2181" String principal =…

java hadoop groovy accumulo

asked Dec 03 '15 at 20:30

Mike Thomsen

36,828
10
60
83

votes

1 answer

Is Accumulo TableOperations.createTable synchronous?

I'm working with Accumulo 1.6.2 and I am seeing an issue where it appears that TableOperations.createTable may return without an exception, but the table still isn't ready. For example in the following code: String tableName = "foo"; TableOperations…

java accumulo

asked Jun 25 '15 at 19:14

Jon

3,212
32
35

votes

1 answer

Why is my executor memory usage stuck at 0?

I have a pretty simple Spark job that looks like this: JavaPairRDD rawData = newAccumuloRDD(...); JavaPairRDD indexSrc = rawData.filter(new IndexFilter()).cache(); JavaPairRDD indexEntries = …

apache-spark accumulo

asked May 13 '15 at 23:38

Russ Weeks

Prev 1 2

…

17 18 Next