I am new to DataBricks and Apache Spark. I would like to use Python for my notebook.
I am using data bricks and I can confirm I have created:
- A Cluster (test) along with a attached python based notebook (test-notebook)
- A Table from JSON File (employee_info)
- A Notebook.
The Sample Table I have is from a JSON which I have uploaded as test file is as below:
{
"FirstName": "John",
"LastName": "Mark",
"MiddleName": "Lewis",
"username": "johnlewis2",
"Id ": "29103-1201-E",
"YearJoined": "2014",
"MonthJoined": "7",
"DayJoined": "23",
"TimeJoined": "14:32",
"Manager": "Larry Hins",
"Position": "Web Developer",
"Building": "Blue Building",
"Floor": "2"
}
If I have uploaded this table, how can I (Assuming I am using Python for the Notebook):
- Access the table
- Duplicate a whole row
- Save the table with a duplicated column as a new table
Thanks.