In my usecase getting set of matching filepattern from Kafka,
PCollection<String> filepatterns = p.apply(KafkaIO.read()...);
Here each pattern could match upto 300+ files.
Q1. How can I use TextIO.Read()
to match data from PCollection
, as withHintMatchesManyFiles()
available only for TextIO.Read()
not for TextIO.ReadFiles()
.
Q2. If path via FileIO.Match->FileIO.ReadMatch()->TextIO.ReadFiles() is used, withHintMatchesManyFiles()
isn't available in this path, how it will impact the read performance?
Q3. Is there any other optimized path for above usecase?