Questions tagged [accumulo]

The Apache Accumulo sorted, distributed key/value store is based on Google's BigTable design. It is built on top of Apache Hadoop, ZooKeeper, and Thrift. It features a few novel improvements on the BigTable design in the form of cell-level access labels and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.

The Apache Accumulo sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, ZooKeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process. Other notable improvements and feature are outlined here.

Google published the design of BigTable in 2006. Several other open source projects have implemented aspects of this design including HBase, Hypertable, and Cassandra. Accumulo began its development in 2008 and joined the Apache community in 2011.

Official Website: https://accumulo.apache.org/

268 questions
2
votes
0 answers

Accumulo custom Filter is not getting called correctly on scan

I have written a custom Filter which is pretty much just a ripoff of the provided AgeOffFilter with some extra stuff. My goal is to remove entries from Accumulo based on a custom time field. public class CustomAgeOffFilter extends Filter { It has…
Smittey
  • 2,475
  • 10
  • 28
  • 35
2
votes
1 answer

Multiple table input for mapreduce

I am thinking of doing a mapreduce using accumulo tables as input. Is there a way to have 2 different tables as input, the same way it exists for the multiple files input like addInputPath ? Or is it possible to have one input from a file and the…
user3794667384
  • 437
  • 7
  • 23
2
votes
2 answers

Spark streaming + Accumulo - Serialize BatchWriterImpl

I'm looking for Spark Streaming + Accumulo connectors and full usage example. Currently I'm trying to write Spark Streaming results to Accumulo table but I'm getting NotSerializableException for BatchWriter. Can someone point me to examples of how…
Marcin
  • 444
  • 3
  • 14
2
votes
2 answers

Using GeoMesa Native API to insert data in accumulo

I am trying to insert and read data from accumulo storage using GeoMesa Native API. I have created a class file to use geomesa accumulo storage natively. Here is my java code : package org.locationtech.geomesa.api; import…
Suresh Prajapati
  • 3,991
  • 5
  • 26
  • 38
2
votes
1 answer

GeoMesa - Differences between the supported Data Stores?

I am skimming through the documentation of GeoMesa and the table of contents exposes a number of supported data stores that can be used: Specific back-end implementations are described in the following chapters: Accumulo Data Store Kafka…
j9dy
  • 2,029
  • 3
  • 25
  • 39
2
votes
1 answer

GeoMesa export query error

Problem: When I try to run below query it gives error. Is there something missing in the CQL query or is a jar file missing ? Also whether it is just a GeoMesa error or accumulo error. Is there any other way to query GeoMesa and get results in a CSV…
kaxil
  • 17,706
  • 2
  • 59
  • 78
2
votes
1 answer

How to query a gemesa-accumulo feature using command line

I ingested data in geomesa accumulo using sfts and converters, Data was ingested successfully and i can visualise the same data using geoserver plugin. I want to filter feature data using command line but however not able to find any commands to do…
Suresh Prajapati
  • 3,991
  • 5
  • 26
  • 38
2
votes
1 answer

Start-all.sh for Accumulo throws multiple warnings about: Max open files on localhost

I am getting multiple warnings about: WARN : Max open files on localhost is 1024, recommend 32768 I am unsure what this is about. /usr/local/accumulo$ ./bin/start-all.sh & [1] 27142 /usr/local/accumulo$ Starting monitor on localhost WARN : Max open…
jnbdz
  • 4,863
  • 9
  • 51
  • 93
2
votes
1 answer

Iterating over accumulo table with scala

I have a table in accumulo named records with a few families and qualifiers per row_id, it looks like this in accumulo shell. michaelp@accumulo records> scan 2016-10-17 16:27:55,359 [Shell.audit] INFO : michaelp@accumulo records> scan E001…
Mike
  • 6,813
  • 4
  • 29
  • 50
2
votes
3 answers

How do you debug accumulo iterators and filters?

I am using logging for debugging, however it's extremely inefficient. What other approaches are there?
kgu87
  • 2,050
  • 14
  • 12
2
votes
1 answer

Read from Accumulo with Spark Shell

I try to use the spark shell to connect to an Accumulo Table I load spark and the libraries I need like this: $ bin/spark-shell --jars…
Martin
  • 659
  • 7
  • 14
2
votes
2 answers

How to restore an instance in Accumulo 1.7.0?

I configured Accumulo 1.7.0 with Hadoop 2.6.0 (HDFS) and Zookeeper 3.4.6, all works good, but i want to know how to restore an instance. Thanks !!! UPDATE The problem is that i want to recover the instance after restart the PC or stop all…
WilD
  • 57
  • 9
2
votes
1 answer

Connection to Accumulo just hangs on initialization

I have Accumulo running in a VM. This Groovy script will connect just fine from within the VM, but outside of the VM it hangs at the first println statement. String instance = "test" String zkServers = "localhost:2181" String principal =…
Mike Thomsen
  • 36,828
  • 10
  • 60
  • 83
2
votes
1 answer

Is Accumulo TableOperations.createTable synchronous?

I'm working with Accumulo 1.6.2 and I am seeing an issue where it appears that TableOperations.createTable may return without an exception, but the table still isn't ready. For example in the following code: String tableName = "foo"; TableOperations…
Jon
  • 3,212
  • 32
  • 35
2
votes
1 answer

Why is my executor memory usage stuck at 0?

I have a pretty simple Spark job that looks like this: JavaPairRDD rawData = newAccumuloRDD(...); JavaPairRDD indexSrc = rawData.filter(new IndexFilter()).cache(); JavaPairRDD indexEntries = …
Russ Weeks
  • 363
  • 1
  • 11
1 2
3
17 18