Questions tagged [druid]

Druid is a column-oriented open-source distributed data store written in Java.

According to the Apache Druid website:

Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.

Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.

597 questions
0
votes
1 answer

Feed druid with data from graylog2 server

Dears I need to get data from graylog2 server into druid (e.g. CPU, memory, disk utilisation of several machines). I've searched for plugins at the graylog marketplace and tranquility documentation, and I did not found any solution to retrieve data…
0
votes
1 answer

Druid RabbitMQ Firehose

I'm trying to setup druid to work with rabbitmq firehose but getting the following error from Tranquility java.lang.IllegalArgumentException: Could not resolve type id 'rabbitmq' into a subtype of [simple type, class…
bbox
  • 1
  • 3
0
votes
1 answer

druid kafka ingestion failed when parsing data using javascript

I have a specific format of messages in kafka. They are caret ^ separated. Now to parse this using tranquility I am using below parse spec. "parser" : { "type" : "string", "parseSpec" : { "timestampSpec"…
Shades88
  • 7,934
  • 22
  • 88
  • 130
0
votes
1 answer

an error loading batch data in druid with Hadoop 2.6.0-cdh5.7.1 (map-reduce)

I constructed the druid,and get error when loading batch data in it, as follows: 2016-10-19T07:06:08,656 INFO [task-runner-0-priority-0] org.apache.hadoop.mapreduce.Job - Counters: 12 Job Counters Failed map tasks=4 …
Carl Yan
  • 3
  • 3
0
votes
0 answers

Druid / Tranquility (server) / Ingestion / Indexing has not finished

I use Druid 0.9.1.1 & Tranquility 0.8.0, and I followed the quickstart steps here: http://druid.io/docs/0.9.1.1/tutorials/quickstart.html The following command succeed: bin/generate-example-metrics | curl -XPOST -H'Content-Type: application/json'…
Cokorda Raka
  • 4,375
  • 6
  • 36
  • 54
0
votes
1 answer

UnresolvedAddressException being hit by Druid broker

I'm new to Druid. I've set up a cluster using the Imply IAP package. Events are being ingested into Druid from Kafka using Tranquility. I can see that the events are being accepted by Tranquility - the following log statements are what I used to…
jithinpt
  • 1,204
  • 2
  • 16
  • 33
0
votes
1 answer

What's the optimal way to run Druid on DC/OS?

What's the best ways to run Druid on DC/OS? I see there are 2 possible options; Marathon and Docker. Which one will you recommend? Is there a better way?
0
votes
1 answer

Comparison effective performance query topN between segmentGranularity by day and by hour

I asked my question on https://groups.google.com/forum/#!topic/druid-user/SYWcqcr504k But no one help me to get around this. I am processing on large dataset. With the topN query on 2 case (segmentGranularity by day vs segmentGranularity by hour)…
VanThaoNguyen
  • 792
  • 9
  • 20
0
votes
1 answer

Modeling data in Hive tables in Caravel

I am curious if there is a way to model the data from Hive tables in Caravel. Similar to how caravel can connect to a Druid cluster and display a UI on the data in Druid, is there a way to model the data in Hive in Caravel using JDBC?
user5228393
0
votes
1 answer

druid schema concept - using multiple datasources or parsers in same spec file

I have the following scenario, 1 Apache Kafka topic where multiple types of events are pushed in. Druid will pick up form this topic and aggregate based on timestamp. Say for example the below are the messages in kafka topic, type 1, {"timestamp" :…
Krishna
  • 3
  • 4
0
votes
1 answer

How should a TSV file be formatted in DRUID?

I am curious how a TSV file should look when we are ingesting data from a local TSV file using DRUID. Should it just be like: Please note this is just for testing: quickstart/sample_data.tsv file: name lastname email time Bob Jones …
LP45
  • 303
  • 3
  • 14
0
votes
1 answer

Hadoop - Kerberos Autentication

I am configuring HDFS as deep storage for Druid and trying to figure out a way to pass kerberos keytab file for authentication. Is there a Java System argument to pass the keytab file location?
jagamot
  • 5,348
  • 18
  • 59
  • 96
0
votes
1 answer

Is aggregation (count) on dimension but not on metrics supported by Druid

For example there are two dimensions: [country, website] and one metric: [PV]. I want to know the average PV of website for each country. To make it, it's easy to get the total PV in each country, however it's difficult to get the count of website…
Junjie
  • 1,145
  • 3
  • 21
  • 37
0
votes
1 answer

Select top N from my_table in druid.io - a simplest request - how?

How can I request a few "records" from Druid without using any search/filter criteria using as simple as possible request? I relational db I'd do that by select top 10 from my_table. The reason I want to do that is to make sure data exists and see…
imatahi
  • 591
  • 1
  • 8
  • 16
0
votes
1 answer

java.nio.channels.UnresolvedAddressException when running druid sample app

I am trying out druid.io with zookeeper 34.6 on fedora 20 x64. U am following tut [here]( http://druid.io/docs/latest/Tutorial:-A-First-Look-at-Druid.html) After huge effort I am able to run zookeeper. Now when I run the server of sample druid…
Mahesha999
  • 22,693
  • 29
  • 116
  • 189