featuretools timeseries data group by month year

Asked Jul 02 '18 at 16:00

Active Jul 05 '18 at 19:27

Viewed 253 times

I have timeseries data with application number, loan amount. How do I get group by count of applications and average loan amount using featuretools package without adding a relationship of month year back to main entity?

I have already accomplished this using pandas but I am trying to explore featuretools package and was wondering if it has group by like functionality.

Below the example of pandas version: I want to replicate it using featuretools.

#Creating a copy of the existing data frame
new_df=df[:] 

#Creating values
new_df['year'] = new_df['DATE'].dt.year
new_df['month'] = new_df['DATE'].dt.month

#Sorting Values
new_df=new_df.drop_duplicates().sort_values(by=['var_1','var2','year','month'])

#Counting Distinct variable across 4 variables then taking cummulative sum across 2 variables and storing it in a new data frame
new_df_count_cummulative=new_df.groupby(['var_1','var_2','year','month']).var_3.nunique().groupby(['var_1','var_2']).cumsum()

edited Jul 05 '18 at 19:27

asked Jul 02 '18 at 16:00

Data Violinist

It isn't clear what you are asking. What have you tried so far? – Greg Jul 02 '18 at 16:20
can you share more details about the dataset you are using and you're existing pandas code? – Max Kanter Jul 02 '18 at 16:33
@MaxKanter Just edited the question to include the pandas code. – Data Violinist Jul 05 '18 at 18:58
@Greg Does my updated question add more clarity? – Data Violinist Jul 17 '18 at 21:05
I'm not familiar with the tools, but could it be chunking that you are looking for? https://docs.featuretools.com/guides/chunking.html – Greg Jul 18 '18 at 08:51

featuretools timeseries data group by month year

0 Answers0