I am trying to read a csv file into a dataframe using an encoder but am running into some issues. The file has the following format:
While it should be taking the first entry and making the respective entry:
92,61,2008-08-01T14:45:37Z,90,13,"http://svnbook.red-bean.com/"">Version Control with SubversionA very good resource for source control in general. Not really TortoiseSVN specific, though.
"
It is missing the entire second paragraph. The following is what I am doing to parse the csv:
case class tit(Id:Int,OwnerUserId:Int,CreationDate:String,ParentID:Int,Score:Int,Body:String)
val schema=Encoders.product[tit].schema
val df=spark.read.schema(schema).csv(fileName)