I have a txt file which is tab separated. Few of the columns have date data in the format of
"Dec-2011", "Jan-1994"
etc the date ranges from "Jan-1944 to Dec-2015"
Problem is the in original data the date format is "Jan-1994" or the year is in YYYY format but when i read the data using Spyder (3.7 or 4.0.1) or Jupyter or Notepad++, everywhere the Date is read as
"Jan-94", "Dec-11" etc
I am using the following code:
import pandas as pd
Raw_Data=pd.read_csv("XYZCorp_LendingData.txt", encoding="Latin-1", sep ='\t', low_memory=False)
The Output should be:- this output is in the office system and THIS is the correct format
Raw_Data["issue_d"].head()
Out[5]:
0 Dec-2011
1 Dec-2011
2 Dec-2011
3 Dec-2011
4 Dec-2011
Name: issue_d, dtype: object
BUT
I am getting output as below because somehow my system is reading in YY format instead of YYYY format. this output is in my laptop which is incorrect
Raw_Data["issue_d"].head()
Out[6]:
0 Dec-11
1 Dec-11
2 Dec-11
3 Dec-11
4 Dec-11
Name: issue_d, dtype: object
This date issue is leading to multiple errors
Also this issue is not only related to Spyder but I am facing the issue with other applications also like Notepad++ and Jupyter
**Also note I made sure I cross-validated the Date and Regional settings in both the systems, both are almost same