Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
0
votes
1 answer

Why is featuretools creating features from my dataframe index?

I've noticed that featuretools created features from my dataframe index. For example: 'LAST(transactions.payment_id)' This is the index I set when creating the entity: es = es.entity_from_dataframe(entity_id = 'transactions', …
SCool
  • 3,104
  • 4
  • 21
  • 49
0
votes
1 answer

FeatureTools GroupBy issue excluding entities

This question is a follow-up of this post: I could solve the first part of the doubt but after that, another arose. I have the following Featuretools entity set: And I would like to get the groupby_trans_primitives: Diff and…
0
votes
1 answer

Using cutoff_times in featuretools for prediction

I am building a model to predict if a user will purchase a subscription based on his/her read history, etc. (activity). I am using featuretools (https://www.featuretools.com/) to automate feature engineering and this is where it gets tricky: How…
0
votes
0 answers

What am I doing wrong? featuretools' performance estimation seems to be abnormal

I was just trying to run the Featuretools demo on my Windows 10 machine with 64GB of RAM However, it says that the feature generation will take 800 hours! Is this normal? See issue I have raised in…
xiaodai
  • 14,889
  • 18
  • 76
  • 140
0
votes
0 answers

FeatureTools: How to use featuretools where the relationship between datasets is many-to-many?

I have two dataframes A and B. A has an id user_id that is the unique id of the user and each row in this dataframe represents whether or not the user made a purchase. This has a timestamp for each instance of userid appearing in it. each user_id…
figs_and_nuts
  • 4,870
  • 2
  • 31
  • 56
0
votes
0 answers

How to write seed_features that work with cutoff_time

I am trying to count the number of first place finishes each runner has had in the past as they compete in their current race, however, a ValueError occurs when I run the code. I'm loading a series of races and runners into Featuretools. Entity:…
Timothy
  • 27
  • 1
  • 5
0
votes
1 answer

KeyError: "labels ['Adj Close'] not contained in axis" using FeatureSelector

I'm at a loss as to what's happening here. I'm downloading historical stock data with Pandas Datareader, and after some small manipulations (ie. re-arranging the dataframe, adding moving averages, etc.), I pass the dataframe to FeatureTools to do a…
wildcat89
  • 1,159
  • 16
  • 47
0
votes
2 answers

Create custom primitive function of a list type using custom variable types

I have a question about featuretools's make_agg_premitives function. In my data there are values that consist of list format. For example, id products a ['a', 'b', 'c'] b ['a','c'] a ['a','c'] I want…
HHK Mmk
  • 37
  • 3
0
votes
0 answers

How to fix "AttributeError: Name of the index variable in the target entity or "instance_id" must be present in cutoff_time" error in featuretools?

I have transactions and customers entities and my label and cutoff_time columns are in transactions entity where my target entity is customers. customers and transactions entities have relationship. How can I point my cutoff_time from customers…
0
votes
1 answer

How can you calculate featurematrix in featuretools more specific to avoid long running?

I want to calculate a feature of second order (depth = 2). Because of the entity structure the feature matrix calculation need to calculate so many combination that the calculation takes "years". Can one more specify via a rule settings the list of…
0
votes
1 answer

where can I find make_labels?

I do not find make_labels I thought it would be part of the independent package utils. But I guess it was part of featuretools. it featuretools.utils you just have make_temporal_cutoffs instead. So how do you use that? Waht would be the translation…
0
votes
1 answer

How do you define a custom primitive with parameters using the Featuretools package?

I'm trying to create a custom transformation using the Featuretools package where I can input a parameter and change the behaviour of the the function For example for the following custom log transformation class I wish to add a base parameter so I…
0
votes
2 answers

FeatureTools TypeError: unhashable type: 'set'

I'm trying this code for featuretools: features, feature_names = ft.dfs(entityset = es, target_entity = 'demo', agg_primitives = ['count', 'max', 'time_since_first', 'median', 'time_since_last', 'avg_time_between', …
Aainaa
  • 11
  • 2
0
votes
1 answer

How do you detect or control dangerous usage of variables and their transformation using featuretools?

When you apply the transformation year, month, or day on a date of a purchase you could very easily run into a problem! Imagine your purchase is in 2018 but you want to apply a model on data of 2019. the Model is developed using features…
0
votes
1 answer

Change units for Time Since Last Primitive in Featuretools

When using the time_since_last primitive, how do I change the units from seconds (the default) to days? I see in the documentation TimeSinceLast objec accepts a units param, but I can’t see an easy way to pass it when using dfs or…
Max Kanter
  • 2,006
  • 6
  • 16