I am trying to create a pipeline in StreamSets Data Collector to read data from a Google Cloud Platform bucket and load the data into the same bucket with a different file name.
The data file in the bucket is in JSON form.
I used the Google Cloud Storage origin in StreamSets Data Collector and gave below properties:
- Common Prefix =
gs://<my-bucket-name>/<json-file-name>
- Prefix Pattern =
https://storage.cloud.google.com/<my-bucket-name>/<json-file-name>
Could someone correct or provide any alternative options?