Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
1
vote
1 answer

How to know the type of features that the Featuretools generates?

How to know the type of features (numeric, categorial) that Featuretools generates?
deevroman
  • 99
  • 1
  • 2
  • 14
1
vote
1 answer

Can I calculate time between local maxima with featuretools?

I would like to calculate time_since_previous, but not transaction after transaction, instead only between transactions that exceed a maximum value. Can I do that automatically? or do I need to slice the dataframe? More specifically, I have a…
Sole Galli
  • 827
  • 6
  • 21
1
vote
1 answer

where can I find the default returned features by dfs [featuretools]

quick question: Is there some doc / resource where to find the default features output by featuretools dfs? For example if i use trans_primitives=["time_since_previous"] it seems that it outputs time in between transactions and also time from first…
Sole Galli
  • 827
  • 6
  • 21
1
vote
1 answer

How to use interesting values with training window in feature tools?

Code: import pandas as pd import featuretools as ft import featuretools.variable_types as vtypes #Create item details table l = [[1, '1', '2018-05-02', 'A', 2.0, 10], [1, '1', '2018-05-02', 'A', 1.0, 10], [2, '1', '2018-05-28', 'B', 1.0, 40], [3,…
1
vote
0 answers

How is DIFF calculated on customer demographics in featuretools?

I have a two tables of of customer information and transaction info. Customer information includes each person's quality of health (from 0 to 100) e.g. if I extract just the Name and HealthQuality columns: John: 70 Mary: 20 Paul: 40 etc etc. After…
SCool
  • 3,104
  • 4
  • 21
  • 49
1
vote
1 answer

How to Make 'Stacked' Features in Featuretools using DFS

Reading the documentation, adjusting max_depth upward should result in complex 'stacked' features. I'm finding I get no difference in features produced after adjusting max_depth to 2. What am I doing wrong? max_depth = 1: Original…
Windstorm1981
  • 2,564
  • 7
  • 29
  • 57
1
vote
0 answers

I got stuck trying to fetch the previous value based on a criteria

I'm new to FeatureTools library, and I got stuck trying to create two types of features, both are related to fetching previous values. One is the previous value itself for 'QUANTIDADE', 'VALOR_TOTAL' and 'DATA_NOTA', and the other is the time since…
1
vote
1 answer

Pass parameters to aggregation primitives in featuretools

I'm using featuretools dfs function to generate my feature_matrix using time-related aggregation primitives such as TimeSince (https://docs.featuretools.com/api_reference.html#deep-feature-synthesis) feature_matrix, feature_defs =…
Ivan
  • 673
  • 2
  • 8
  • 20
1
vote
1 answer

FeatureTools: Can there be multiple columns in time_index parameters?

I have 2 time paramaters in a dataframe i.e. start_date and end_date and both are time parameters when one creates a entityset from a dataframe. While specifying the time_index, can we specify 2 different columns? I don't want to create a new column…
Ganesh Bhat
  • 295
  • 7
  • 20
1
vote
1 answer

featuretools historical label counts

Background For a CRM project, I have snapshots of opportunities. I've been able to build many features w/ featuretools already, but what I really want is to have historical win counts and rate. In other terms, I'd like to know: For a given…
Anders Swanson
  • 3,637
  • 1
  • 18
  • 43
1
vote
1 answer

featuretools cutoff_time does not work properly

I tried add a cutoff_time dataframe to the dfs function of featuretool so that each row in my dataframe will have a specific cutoff time.But I cannot make the cutoff_time dataframe work as expected The documentation said that the first column of…
Adam Li
  • 13
  • 2
1
vote
1 answer

AttributeError: Parent variable '' is not the index of entity Entity: train Variables:

I am trying to create an entity relationship based on 2 columns which are same in both dataframes but I am getting the error as stated in the title. Searched Internet for the issue but could not find anything es =…
Ganesh Bhat
  • 295
  • 7
  • 20
1
vote
1 answer

Featuretools TypeError: unhashable type: 'Int64Index'

I am trying to create an entity set from 3 dataframes and while doing so I am getting the error: TypeError: unhashable type: 'Int64Index' I have searched the Internet for similar issues but could not find any issues related to datetime types. Please…
Ganesh Bhat
  • 295
  • 7
  • 20
1
vote
1 answer

How to get an item's group mean but exclude the item itself?

How can I use feature tools to get a mean value of the group which the item belongs to, but excludes the item itself? For example, Input: item group value1 I1 C1 1 I2 C2 5 I3 C2 3 I4 C2 …
wenying
  • 13
  • 2
1
vote
1 answer

When is the time_type set to NumericTimeIndex or DatetimeTimeIndex in the entityset?

I have problems calculating feature_matrix using cutoff_times table because of a type mismatch in cutoff times in the cutoff table and the time_type of the entityset. I am trying to understand the predict-next-purchase example using synthetic…