Is there anyway to write Dataset object to ORC file? I know a Dataset object can be written as avro file by using AvroOutputFormat
, but looks like there is no equivalent class for orc?
If that can not be achieved, is there any way to convert Dataset to Table or DataStream?
The reason I am asking is that I have to use Dataset API since it supports reading multiple files. Like this
AvroInputFormat<MyType> avroInputFormat = new AvroInputFormat<>(....
avroInputFormat.setFilePaths(<file paths list>)
DataSet<MyType> dataset = env.createInput(avroInputFormat);
this will work. However, if I use DataStream API it will throw exception as
Caused by: java.lang.IllegalArgumentException: FileInputFormats with multiple paths are not supported yet.
Any suggestions will be greatly appreciated. Thanks!