Questions tagged [amazon-kinesis-analytics]

Amazon Kinesis Data Analytics is the way to analyze streaming data, gain actionable insights, and respond in real time. SQL users can query streaming data or build entire streaming applications using templates and an interactive SQL editor. Java developers can build streaming applications using open source Java libraries and AWS integrations to transform and analyze data in real-time.

133 questions

votes

0 answers

AWS Kinesis Data Analytics (Apache Flink java app) crashing randomly on Kinesis inputstream RuntimeException

I have three different Apache Flink applications running on 3 different AWS kinesis data analytics which are consuming data from the same kinesis input stream. After a while I can see all the flink randomly crashed and restarts with the error…

apache-flink amazon-kinesis amazon-kinesis-analytics

asked Sep 22 '22 at 02:06

Avik Das

votes

1 answer

Kinesis data stream skipping records with exception at downstream

I have an application with the following set up. Kinesis data stream (retention period: 1day) -> StreamExecutionEnvironment .getExecutionEnvironment() .addSource(new FlinkKinesisConsumer) .map(new MapFunction()) .addSink(); When the MapFunction…

apache-flink amazon-kinesis amazon-kinesis-analytics

asked Jul 24 '22 at 05:15

Brian

votes

1 answer

kinesis data stream performance testing with partition key

I am using the Kinesis Data Generator tool and I was wondering how to define the partition key in the test data so that the data is distributed to all the shard evenly. https://awslabs.github.io/amazon-kinesis-data-generator/web/producer.html

amazon-kinesis amazon-kinesis-analytics

asked Jul 05 '22 at 06:29

zimmerdimmer

votes

1 answer

Cannot connect Flink to Elasticache Redis cluster - FlinkJedisClusterConfig unable to parse cport in CLUSTER NODES response

How can I use an Elasticache Redis Replication Group as a data sink in Flink for Kinesis Analytics? I have created an Elasticache Redis Replication Group, and would like to compute something in Flink and store the results in this group. My Java…

redis apache-flink jedis amazon-elasticache amazon-kinesis-analytics

asked Jun 14 '22 at 16:02

Jake

votes

1 answer

ClassNotFoundException while running jar on Amazon Kinesis Streaming Analytics app

I have created a Kinesis Analytics Streaming Application in SpringBoot which will consume messages from the AmazonKinesis input stream and will do some operations on top of it using the Apache Flink DataStream library. When, I am uploading the…

java spring apache-flink amazon-kinesis amazon-kinesis-analytics

asked May 27 '22 at 13:33

Jay

votes

2 answers

how to join two data streams along with sliding window function in Flink Table API?

I have two streaming tables from two Kafka topic and I want to join these streams and perform aggregate function on the data joined. Streams need to be joined using sliding window. On joining and windowing the data, I am getting an error Rowtime…

apache-flink flink-streaming apache-zeppelin flink-sql amazon-kinesis-analytics

asked May 24 '22 at 21:49

data_adi

votes

1 answer

Why does my watermark not advance in my Apache Flink keyed stream?

I am currently using Apache Flink 1.13.2 with Java for my streaming application. I am using a keyed function with no window function. I have implemented a watermark strategy and autoWatermarkInterval config per the documentation, although my…

java apache-flink flink-streaming amazon-kinesis-analytics

asked May 23 '22 at 12:48

Ryan

votes

1 answer

Recalculate historical data using Apache Beam

I have an Apache Beam streaming project that calculates data and writes it to the database, what is the best way to reprocess all historical records after a bug fix or after changing the way it processes data without a big delay?

bigdata apache-flink apache-beam dataflow amazon-kinesis-analytics

asked May 13 '22 at 14:18

Oleksandr Vetoshkin

votes

1 answer

Combine two keys in Apache Beam

I have an Apache Beam streaming project that uses Combine.perKey(), I need to be able to merge entities from my admin tool (to point one entity to another one), how to combine two keys with calculated data in Beam? It's easy to do it for the new…

bigdata apache-flink apache-beam dataflow amazon-kinesis-analytics

asked May 13 '22 at 14:10

Oleksandr Vetoshkin

votes

1 answer

I have configured my Flink Application using PyFlink, but I want to change the Job Name

I have configured Amazon Kinesis Data Analytic using PyFlink, but I want to change the Job Name to whatever I want. How can I do this?

amazon-kinesis pyflink amazon-kinesis-analytics

asked Mar 28 '22 at 03:00

Jongmin Park

votes

1 answer

Kinesis Firehose Lambda Transformation and Dynamic partition

The following data presented is from the faker library. i am trying to learn and implement dynamic partition in kinesis Firehose Sample payload Input { "name":"Dr. Nancy Mcmillan", "phone_numbers":"8XXXXX", "city":"Priscillaport", …

python amazon-kinesis amazon-kinesis-firehose amazon-kinesis-analytics amazon-kinesis-video-streams

asked Mar 26 '22 at 14:09

Soumil Nitin Shah

votes

1 answer

Apache Flink StreamingFileSink making several HEAD requests while writing to S3 which causes ratelimiting

I have an Apache Flink application that I have deployed on Kinesis Data analytics. This application reads from Kafka and writes to S3. The S3 bucket structure it writes to is computed using a BucketAssigner.A stripped down version of the…

amazon-s3 hadoop apache-flink amazon-kinesis-analytics

asked Mar 17 '22 at 00:49

Vinod Mohanan

3,729
2
17
25

votes

0 answers

Heavy back pressure and huge checkpoint size

I have an Apache Flink application that I have deployed on Kinesis Data analytics. Payload schema processed by the application (simplified version): { id:String= uuid (each request gets one), category:string= uuid (we have 10 of…

apache-flink flink-streaming amazon-kinesis-analytics

asked Mar 15 '22 at 00:47

Vinod Mohanan

3,729
2
17
25

votes

2 answers

How to update/refresh a parameter in Flink application

I have a Flink application on AWS Kinesis Analytics service. I need to filter some values on a data stream based on a threshold. Also, I'm passing the threshold parameter using AWS Systems Manager Parameter Store service. For now, I got this: In my…

scala apache-flink amazon-kinesis-analytics

asked Jan 27 '22 at 18:49

Felipe Jorquera Uribe

votes

1 answer

Flink - DynamoDB source

I'm new working with real-time applications. Currently, I'm using AWS Kinesis/Flink and Scala I have the following architecture: old architecture As you can see I consume a CSV file using CSVTableSource. Unfortunately, the CSV file became too big…

scala amazon-dynamodb apache-flink amazon-kinesis-analytics

asked Jan 06 '22 at 21:53

Felipe Jorquera Uribe

Prev 1 2 3

…

8 9 Next