Questions tagged [apache-samza]

Apache Samza is a distributed stream processing framework.

Apache Samza is a distributed stream processing framework.

It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.

It has support for stateful stream processing natively.

Apache Samza is a top level project of the Apache Software Foundation.

82 questions
0
votes
1 answer

Apache Samza aggregation rules for missing expected events in rolling time-period

My use-case is to identify entities from which expected events have not been received after X amount of time in real-time. For example: If we have received PaymentInitiated event at time T but didn't receive either of PaymentFailed / PaymentAborted…
Harish
  • 7,589
  • 10
  • 36
  • 47
0
votes
1 answer

Apache Storm aggregation rules for missing expected events in rolling time-period

My use-case is to identify entities from which expected events have not been received after X amount of time in real-time rather than using batch jobs. For Example: If we have received PaymentInitiated event at time T but didn't receive either of…
0
votes
1 answer

Testing Samza with RocksDB application with SBT

I would like to run a Samza (using RocksDB KV store) application from SBT. When I do ./sbt "run " I receive the following error java.lang.ExceptionInInitializerError (snip) Caused by: java.lang.RuntimeException: librocksdbjni-linux64.so was not…
Luís Guilherme
  • 2,620
  • 6
  • 26
  • 41
-1
votes
1 answer

Samza Failed to send message. Exception

I'm using samza on aws emr instances, and I always have exception like follows, can anyone help me?: org.apache.samza.SamzaException: Failed to send message. Exception: java.lang.IllegalStateException: Cannot send after the producer is closed. …
justr
  • 11
  • 1
  • 4
-2
votes
2 answers

Samza build with gradle failing

Samza build with gradle is failing while executing testing with on samza-core. Below is the exception I am facing: Please share your thoughts... org.apache.samza.config.factories.TestPropertiesConfigFactory > testCanReadPropertiesConfigFiles…
Raja
  • 513
  • 5
  • 18
-3
votes
2 answers

Relevance of Hadoop & Streaming solutions when Spark exists

I am starting a big data initiative for my startup. In 2018 is there any reason to use Hadoop at all since Spark is touted to be way faster due to it primarily not writing the intermediate data to disk as Hadoop’s MR. I realize Spark has a higher…
-4
votes
1 answer

How to process 1.2 million messages per second using apache samza?

Now i made basic setup of samza . After that i process 1,00,000 json records which is from kafka broker topic Each records in size of 100 bytes But per second i can only process 11000 records. How to increase processing speed?
1 2 3 4 5
6