Questions tagged [avroio]

2 questions
0
votes
1 answer

Apache beam reading Avro files from GCS and writing to BigQuery

Running a java job to read Avro files and have been getting errors. Looking for help on this - Here is the code - // Get Avro Schema String schemaJson = getSchema(options.getAvroSchema()); Schema schema = new Schema.Parser().parse(schemaJson); //…
0
votes
1 answer

Apache Beam AvroIO read large file OOM

Problem: I am writing an Apache Beam pipeline to convert Avro file to Parquet file (with Spark runner). Everything works well until I start to convert large size Avro file (15G). The code used to read Avro file to create PColletion: …
fuyi
  • 2,573
  • 4
  • 23
  • 46