Questions tagged [flink-sql]

Apache Flink features two relational APIs, SQL and Table API, as unified APIs for stream and batch processing.

Apache Flink features two relational APIs:

SQL (via Apache Calcite)
Table API, a language-integrated query (LINQ) interface

Both APIs are unified APIs for stream and batch processing. This means that the a query returns the same result regardless whether it is applied on a static data set or a data stream. SQL queries are parsed and optimized by Apache Calcite (Table API queries are optimized by Calcite).

Both APIs are tightly integrated with Flink's DataStream and DataSet APIs.

667 questions

vote

1 answer

Adding a column in Flink table

I'm trying to add a new column to a flink table in Java Table table = tEnv.sqlQuery(query.getQuery()); table = table.addColumns($("NewColumn")); but I'm running into this ValidationException: org.apache.flink.table.api.ValidationException: Cannot…

asked Nov 24 '22 at 00:29

shepherd

vote

0 answers

Flink listener is not getting called when using statementSet.execute

I'm using flink 1.13, using statementSet.execute but added a listener in Flink stream env, onJobSubmitted is getting called when the job is submitted (no compile issues with plan) but to bug the pipeline, I have a string field in Kafka but int…

apache-flink flink-sql

asked Nov 08 '22 at 04:58

user9068199

vote

0 answers

Flink SQL: Unsupported type(ARRAY) to generate hash code

I am trying to use flink sql to load avro data and perform various operations. One field of the original data has the Array type, and no matter what operations I want to do, like very simply Table result = inputTable.where(or($("status").isNull(),…

apache-flink flink-sql

asked Oct 16 '22 at 05:24

tottistar

vote

1 answer

Flink CEP sql restrict output

I have a use case where I have 2 input topics in kafka. Topic schema: eventName, ingestion_time(will be used as watermark), orderType, orderCountry Data for first topic: {"eventName": "orderCreated", "userId":123, "ingestionTime": "1665042169543",…

apache-flink flink-sql flink-cep

asked Oct 06 '22 at 07:53

user9068199

vote

1 answer

Flink Windows - how to emit intermediate results as soon as new event comes in?

Flink 1.14, Java, Table API + DataStream API (toDataStream/toAppendStream). I'm trying to: read events from Kafka, hourly aggregate (sum, count, etc.) and upsert results to Cassandra as soon as new events are coming, in other words — create new…

apache-flink flink-streaming flink-sql

asked Sep 26 '22 at 20:59

deeplay

vote

1 answer

Unforeseeable Tombstones messages when joining with Flink SQL

We've a SQL Flink Job (Table API) that reads Offers from a Kafka topic (8 partitions) as source and sinks it back to another Kafka topic after some aggregations with other data sources to calculate the cheapest one and aggregate extra data over that…

apache-flink flink-streaming flink-sql

asked Sep 22 '22 at 12:46

Jaume Jiménez

vote

0 answers

Flink SQL : How to unpack fields in ROW type as multiple columns?

I call a UDF in such a Flink SQL query: SELECT dvid, rank_name, rank_type, window_start, window_end, RankDif(rank_order,rank_pt) AS rank_cur FROM TABLE( HOP(TABLE UniqueRankTable, DESCRIPTOR(rank_pt), INTERVAL '1' DAY, INTERVAL '2'…

sql apache-flink flink-sql flink-table-api

asked Sep 09 '22 at 13:00

Singleton

vote

1 answer

How to read data from HDFS with Flink in python

I want to read data from HDFS with Flink in python I found it possible with Java or Scala : https://nightlies.apache.org/flink/flink-docs-release-1.15/docs/connectors/dataset/formats/hadoop/ Indeed, Flink HDFS connector provides a Sink that writes…

hadoop hdfs apache-flink flink-sql pyflink

asked Sep 08 '22 at 10:03

Zak_Stack

vote

2 answers

Is it better to use Row or GenericRowData with DataStream API?

I Am working with flink 1.15.2, should i use Row or GenericRowData that inherit RowData for my own data type?, i mostly use streaming api. Thanks. Sig.

apache-flink flink-streaming flink-sql

asked Aug 28 '22 at 12:00

erich

vote

1 answer

Flink SQL Watermark Strategy After Join Operation

My problem is that I cannot use the ORDER BY clause after the JOIN operation. To reproduce the problem, CREATE TABLE stack ( id INT PRIMARY KEY, ts TIMESTAMP(3), WATERMARK FOR ts AS ts - INTERVAL '1' SECONDS ) WITH ( 'connector' =…

apache-flink flink-sql

asked Aug 19 '22 at 08:26

Metehan Yıldırım

vote

1 answer

Is there a Flink Table API equivalent to Window Functions using row_number(), rank(), dense_rank()?

In an attempt to discover the possibilities and limitations of the Flink Table API for use in a current project, I was trying to translate a Flink SQL statement into its equivalent Flink Table API version. For most parts, I am able to translate the…

apache-flink window-functions flink-sql flink-table-api

asked Aug 16 '22 at 15:40

Bart Gerard

vote

0 answers

Correlated Subquery in Flink

I want to join two tables (left and right) which are generated by below queries. CREATE TABLE left_table ( `experiment_id` BIGINT, `f_sequence` BIGINT, `line_string` STRING, `log_time` TIMESTAMP(3), WATERMARK FOR log_time AS…

apache-flink flink-streaming flink-sql

asked Aug 15 '22 at 10:44

akurmustafa

vote

2 answers

How to determine number of task slots in flink

I am trying to determine how to devide the task slots for my flink job. To be more specific, is there a reason to use 2 task slots (or more) per task manager instead of one task slot per task manager? I read that multiple task slots per task manager…

java apache-flink flink-streaming flink-sql

asked Aug 09 '22 at 07:23

JoeHills

vote

0 answers

In Flink table API, how do you use postgres timestamps in scan.partition.column scan.partition.lower-bound etc

In Flink 1.13, how do you configure a CREATE TABLE statement to use a postgres timestamp column to partition by? Things I have tried: In postgres, I have a column named 'my_timestamp' of type TIMESTAMP WITHOUT TIME ZONE In my Flink CREATE TABLE…

apache-flink partition flink-sql flink-table-api

asked Aug 02 '22 at 03:30

Jordan Morris

2,101
2
24
41

vote

1 answer

use Flink SQL multiple case when

I am using Flink SQL generate explain. select case when count(*)>1 then '11' end as query,case when src_ip='6' then '22' end as query from table but found exception,it say Expression 'src_ip' is not being grouped when I alter count(*) to…

flink-sql

asked Aug 01 '22 at 07:14

jd g

Prev 1 2 3

…

44 45 Next