3

We have to read data from CSV files and map two files with respect to one column and push data to Cloud SQL using Google Cloud Dataflow.

We are able to read data from CSV files but stuck with the next steps. Please provide me information or links regarding the following:

  1. Merging/joining to flat files based on one column or condition with multiple columns
  2. Copying merged pcollection into Сloud SQL database
Cœur
  • 37,241
  • 25
  • 195
  • 267
Ajay K N
  • 83
  • 8
  • Can you please clarify what you mean by "map two files with respect to one column" - do you mean you want to join two CSV files based on the value of a column and then dump the result to Cloud SQL? A short example of input and expected output would be helpful. – jkff Apr 24 '15 at 16:47

1 Answers1

3

Here's some pointers that may be helpful:

jkff
  • 17,623
  • 5
  • 53
  • 85
  • 1
    How do you configure the Cloud SQL instance for access from DataFlow? Do you need to somehow use the proxy? – sthomps Apr 09 '16 at 04:18