Highest Voted 'spark-avro' Questions

0

votes

0 answers

How to read Avro file in spark-shell in Spark2.4.8?

I want help from several people. We are facing problem while reading avro file in spark2-shell in Spark2.4 Any pointers will be of great help. The cause of the error could not be found. $spark-shell --jars…

asked Sep 05 '22 at 09:16

jsk

3
3

0

votes

0 answers

Unable to deploy Avro Spark. Access denied

I am training to work with spark. Parquet and csv in Jupyter work correctly. When I started trying the avro format, this error appeared. Screen 1 Here they offer a solution. Apache Avro Data Source Guide ./bin/spark-submit --packages…

apache-spark pyspark avro spark-avro

asked Jul 04 '22 at 02:06

Decurro

1
2

0

votes

0 answers

Spark-avro Cannot grow BufferHolder because the size is negative - where to look for the cause?

Environment: Scala 2.11 Spark 2.4 Hortownorks SchemaRegistry Kafka messages with embedded schema information. Context As stated above, I am aware of how Hortonworks SchemaRegistry information is embedded in the Kafka message. First 13 bytes of…

scala apache-spark apache-kafka spark-avro hortonworks-dataflow

asked Jun 15 '22 at 12:28

napkin-pumpkin

1
1

0

votes

1 answer

Installing Apache Spark Packages to run Locally

I am looking for a clear guide or steps to installing Spark packages (specifically spark-avro) to run locally and correctly using them with spark-submit command. I've spent a lot of time reading many posts and guides, but still not able to get…

apache-spark pyspark apache-spark-sql spark-avro

asked May 17 '22 at 14:59

bda

372
1
7
22

0

votes

1 answer

How to Deseralize Avro response getting from Datastream Scala + apache Flink

I am Getting Avro Response from a Kafka Topic from Confluent and i am facing issues when i want to deseralize the response. Not Understanding the Syntax How i should define the Avro deserializer and use in my Kafka Source while reading. Sharing the…

scala apache-flink avro flink-streaming spark-avro

asked May 06 '22 at 06:26

Vishist Bhoopalam

79
5

0

votes

1 answer

AVRO file not read fully by Spark

I am reading AVRO file stored on ADLS gen2 using Spark as following: import dbutils as dbutils from pyspark.conf import SparkConf from pyspark.sql import…

apache-spark avro spark-avro

asked Nov 15 '21 at 08:15

RRM

2,495
29
46

0

votes

0 answers

Can not read AVRO data from kafka stream in spark scala app

I have kafka topic with simple avro serialized data in it and I am trying to read this data in my spark app which is on scala. When I print spark Dataframe to console, I can see that there are issues with desterilizing (or smth else) because my…

apache-spark apache-kafka spark-structured-streaming spark-avro

asked Nov 06 '21 at 19:13

Illia

45
6

0

votes

1 answer

How to use spark_read_avro from sparklyr R package?

I'm using: R version 4.1.1 sparklyr version ‘1.7.2’ I'm connected to my databricks cluster with databricks-connect and trying to read an avro file using the following code: library(sparklyr) library(dplyr) sc <- spark_connect( method =…

r databricks avro sparklyr spark-avro

asked Oct 12 '21 at 15:36

Anci

11
3

0

votes

1 answer

Spark Batch Avro Deserialization: Malformed data. Length is negative

I am doing some batch processing on Kafka through Spark. The record as serialized as Avro. I am trying to deserialize the value using the exact schema in the message itself but am getting a malformed record exception. Here's my code: …

apache-spark avro confluent-schema-registry spark-avro

asked Aug 30 '21 at 19:44

Prashant Pandey

4,332
3
26
44

0

votes

1 answer

Importing Spark avro packages into a dockerized python project to import avro file in S3

I am trying to read some avro files stored in S3 bucket with the following code. spark version is 2.4.7 from pyspark.sql import SparkSession spark = SparkSession.builder.appName('Statistics').getOrCreate() sc = spark.sparkContext df =…

python apache-spark amazon-s3 avro spark-avro

asked Jul 09 '21 at 17:16

tharindu

513
6
26

0

votes

0 answers

Fetching avro data from kafka using spark

I tried to publish records from a dataframe built from an avro file while it is built from a CSV file using dataframe. I published the data into a kafka topic in avro format using to_avro(struct(*)) from the dataframe, I was able to view the binary…

python dataframe apache-spark apache-kafka spark-avro

asked Jun 21 '21 at 12:38

Aravind

1

0

votes

1 answer

Create AVRO File AWS Glue Dynamic Frame One to Many Join

Is the following behavior possible in AWS Glue? I am trying to create a single AVRO file by joining two DynamicFrames in a one-to-many fasion. For example I have a DyF with many Teacher types: teacher_id teacher_name and a Dyf with many Student…

python amazon-web-services etl aws-glue spark-avro

asked May 12 '21 at 23:07

Zachm

1

0

votes

1 answer

Convert dataset to dataframe from an avro file

I wrote a scala script to load an avro file, and to work with the generated data (to retrieve top contributors). The problem is that while loading the file it gives a dataset that i can not convert to dataframe cuz it contains some complex types: …

scala dataframe apache-spark spark-avro

asked May 01 '21 at 15:58

Issibra

79
1
10

0

votes

1 answer

AvroDeserialisation Failing when deriving a col using sum but is successful when the same column is derived using count.Serialised data is in kafka

Here is my SQL which works : select hostnetworkid,roamertype,carrierid, total_failure,total_count,date_format(timestamp(unix_timestamp(window.start)),\"yyyyMMdd\") as eventdate, date_format(timestamp(unix_timestamp(window.start)),\"HH:mm\") as…

apache-spark apache-kafka avro spark-avro

asked Apr 01 '21 at 08:18

kushagra deep

462
6
12

0

votes

1 answer

Avro schema ( .avsc ) enforcement in Pyspark

Can anyone help me with reading a avro schema (.avsc ) through Pyspark and enforcing it while writing the dataframe to a target storage ? All my targetr table schemas are provided as .avsc files and I need to provide this custom schema while saving…

pyspark avro spark-avro

asked Mar 23 '21 at 04:46

ASHISH M.G

522
2
7
23

Questions tagged [spark-avro]