If there is change in schema in my incoming CSV file, how can we handle this in spark?
Suppose on Day-1,
I got a csv file with schema and data as below,
FirstName LastName Age
Sagar Patro 26
Akash Nayak 22
Amar Kumar 18
And on…
I am trying to read a tsv file in pyspark using spark_csv package. My spark version is 1.6.3. In my dataset, two columns have same name. I am using following code to read the data.
temp=sqlContext.read.load(data_file,
…
i have data similar to this.
no,full_name,Ocupation,DOB
1,Eugene Perry,Physical Therapy Assistant,11/22/1972
2,Scott Evans,Account Executive,8/25/1961
3,Harold Porter,Community Outreach Specialist,4/5/1986
4,Roger Turner,Account…