Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
3
votes
0 answers

Specifying interesting_variables with featuretools does not work

I'm currently working through the feature tools docs using my own data. So far everything worked fine but I got stuck at adding interesting variables. For some reason, I can't make it work and I am not sure why. The example in the doc works just…
3
votes
1 answer

how to use ft.dfs result join to test set?

I know featuretools has ft.calculate_feature_matrix method, but it calculate data use test. I need when I get the feature use train data,and join to test data not use the same feature on test data. for example: train data: id sex score 1 f 100 2 f…
3
votes
1 answer

adding a variable to an entity - featuretools

I'm trying to add a new variable to an entity. I'm trying to add a variable as follows: es['Product'].add_variable("inventory", data=inventory_series) however I'm getting this error: TypeError: 'Series' objects are mutable, thus they cannot be…
3
votes
1 answer

Calculating same features with multiple training windows in Featuretools

Featuretools supports already handling of multiple cutoff times https://docs.featuretools.com/automated_feature_engineering/handling_time.html In [20]: temporal_cutoffs = ft.make_temporal_cutoffs(cutoffs['customer_id'], ....: …
Georg Heiler
  • 16,916
  • 36
  • 162
  • 292
3
votes
1 answer

Featuretools create index from multiple columns

I am trying to create an entity from a dataframe using the entity_from_dataframe function in featuretools. Is there a way to define the index if it comprises of more than one column. I'm unsure if I need a list, tuple or some other data structure.…
lcampos
  • 31
  • 2
3
votes
2 answers

Excluding the current row from feature engineering in Python featuretools

I'm generating historical features for the current row with featuretools. For example, the number of transactions made in the last hour during a session. Package featuretools includes parameter cutoff_time to exclude all rows that come after…
Anton Tarasenko
  • 8,099
  • 11
  • 66
  • 91
2
votes
2 answers

deep feature synthesis depth for transformation primitives | featuretools

I am trying to use the featuretools library to make new features on a simple dataset, however, whenever I try to use a bigger max_depth, nothing happens... Here is my code so far: # imports import featuretools as ft # creating the EntitySet es =…
2
votes
1 answer

Python featuretools difference by data group

I'm trying to use featuretools to calculate time-series functions. Specifically, I'd like to subtract current(x) from previous(x) by a group-key (user_id), but I'm having trouble in adding this kind of relationship in the entityset. df =…
2
votes
1 answer

Featuretools to aggregate on sub-entities preserving cut_off times

I tried using featuretools with timestamps to use past decisions of distributors as a predictive variable. I have only one dataset as input, with a typical binary classification problem (with n rows). There are set of distributors (< It is very…
Paul
  • 45
  • 3
2
votes
1 answer

How to write seed_features that include a conditional statement

I'm trying to write a seed feature that produces reward if place == 1 else 0. place and reward are both ft.variable_types.Numeric: Entity: results Variables: id (dtype: index) place (dtype: numeric) reward (dtype: numeric) I've tried…
Timothy
  • 27
  • 1
  • 5
2
votes
0 answers

Error in install conda-forge featuretools

I have one question in install featuretools by conda-forge. After following command conda install -c conda-forge featuretools I got these errors. UnsatisfiableError: The following specifications were found to be incompatible with a past explicit…
Weijie Sun
  • 194
  • 6
2
votes
1 answer

trans_primitives are not generated to the datetime column

I am creating a featuretools matrix, that is generated out of 5 dataframe entities and one cutoff_time table. When I use the ft.dfs() function, I use both agg_primitives and trans_primitives, however all the primitives in trans_primitives that…
IshayT
  • 21
  • 3
2
votes
1 answer

How do you implement a weighted sum transform primitive in Featuretools?

I'm trying to figure how to implement a weighted cum sum primitive for Featuretools. The weighting shall depend on time_since_last like cum_sum (amount) = sum_{i} exp( -a_{i} ) * amount_{i} where i are rolling 6 Month periods.... above you find…
2
votes
1 answer

list of built-in aggregate and transform primitives

First, I LOVE featuretools. It has made my work so much easier and more efficient. One quick question: I was just looking for a full list of non-custom agg & trans primitives, but couldn't seem to find it. Do I just take the list of methods in the…
Don
  • 857
  • 1
  • 9
  • 19
2
votes
0 answers

Featuretools Deep Feature Synthesis (DFS) extremely high overhead

The execution of both ft.dfs(...) and ft.calculate_feature_matrix(...) on some time series to extract the day month and year from a very small dataframe (<1k rows) takes about 800ms. When I compute no features at all, it still takes about 750ms.…
Philliams
  • 21
  • 3
1
2
3
14 15