Questions tagged [camus]

LinkedIn's Kafka to HDFS pipeline.

An api to pull data from kafka into HDFS. It fetches events of available topics and then store them topic wise. It is also responsible for collecting event count statistics.

https://github.com/linkedin/camus/wiki/Camus-Overview

19 questions
0
votes
1 answer

Passing properties file to Oozie Java Action

I have a Oozie java action workflow set up, which I plan to use an Oozie coordinator to schedule. The java action runs a Camus Job, whose jar and properties config file I have put in the workflow/lib directory. Any ideas on how I would pass the -P…
Jake Chase
  • 27
  • 1
  • 5
0
votes
1 answer

java.lang.ClassNotFoundException when trying to run camus

I downloaded the confluent package which includes camus jars and I followed the instructions online enter link description here. Hadoop is properly setup (meaning I can use hadoop fs -ls commands and other hadoop jar commands). However, when i…
JudyJiang
  • 2,207
  • 6
  • 27
  • 47
0
votes
1 answer

Prevent camus from increasing the offset value

Is there any way of preventing camus increase the offset value, and make it always read the previous value? And also how to provide a custom name to the file that gets created instead of the general format i.e.…
-1
votes
1 answer

camus or gobblin which is preferable

Can you please help me in setting up camus or gobblin to store messages in HDFS from Kafka. A Working example could be great. Gobblin is still in incubation phase and camus is phased out. So which one is preferable to use. i downloaded gobblin and…
VIJ
  • 1,516
  • 1
  • 18
  • 34
1
2