I would like to use kafka connect s3 sink connector to stream data our of a topic to s3 bucket. The data inside the topic will be xml messages. As per connector config, we can define the format of the message(for example: JsonFormat)
As per the confluent docs, it looks like we can define custom format by implementing
io.confluent.connect.storage.format.Format.
I was going through the available format code like JsonFormat, it looks like actual logic of format is in JsonRecordWriterProvider which is implementation of
io.confluent.connect.storage.format.RecordWriterProvider
I see that RecordWriter write implementation which is applying JsonConvertor convert method over SinkRecord.value().
How can we know what does SinkRecord contains and can we just write a xmlconvertor and convert the SinkRecord.value() to a dom object etc ?
Any references that I could you use to implement one ?
I was going through the code provided by confluent.