Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
0
votes
0 answers

Unable to establish relationship in featuretools

I am having some problems when using this: Several of my entities have "id" and "time", and of course there are some other records. Since it records the activity data for each id every day, my id is not unique in a table. I want to establish a…
罗文浩
  • 79
  • 1
  • 1
  • 8
0
votes
1 answer

Featuretools - LookupError: Time index not found in dataframe

I have a input dataframe which I have split up into 3 entities based on the attributes. When I try to generate features using featuretools I get the above mentioned error input dataframe in_df = ['UniqueID', 'disbursed_amount', 'asset_cost', 'ltv',…
Ganesh Bhat
  • 295
  • 7
  • 20
0
votes
1 answer

Avoid duplication of date column for child entity

I have a simple entity set parent1 <- child -> parent2 and a need to use a cutoff dataframe. My target is the parent1 and it's accessible at any time of predictions. I want to specify a date column only for the parent2 so that this time information…
Sergey Skripko
  • 336
  • 1
  • 8
0
votes
1 answer

How do I interpret this particular feature

lets say I have 3 entities: parent1 <- child -> parent2. I used dfs() and got feature I can't understand MEAN(child.parent2.MEAN(child.num_feature)). Reading documentation I thought about any_entity.MEAN features as "group by entity then apply MEAN"…
Sergey Skripko
  • 336
  • 1
  • 8
0
votes
1 answer

prediction and time series

how to decide how in advance my prediction is? i am following the featuretools churn tutorial https://github.com/Featuretools/predict-customer-churn what i don't quite understand how did it decide that the prediction is for one month in advance..…
0
votes
1 answer

AttributeError: module 'utils' has no attribute 'make_labels' in featuretools in Python

I am trying to follow the Featuretools tutorial named: Predicting a customer's next purchase using automated feature engineering For more information on Featuretools look here: https://docs.featuretools.com/index.html The data can be obtained here:…
user8270077
  • 4,621
  • 17
  • 75
  • 140
0
votes
1 answer

AttributeError: 'EntitySet' object has no attribute 'plot' in featuretools

I am getting this exception when I try to run the command: es.plot() AttributeError: 'EntitySet' object has no attribute 'plot' The es object is an 'EntitySet' class object in featuretools. I created es using the following commands: es =…
user8270077
  • 4,621
  • 17
  • 75
  • 140
0
votes
2 answers

LookupError: Time index not found in dataframe

Here is the code to reproduce this issue, but it can be avoided by removing "orders" entity. import featuretools as ft import pandas as pd import numpy as np df = pd.DataFrame({'member_id': ['AAA', 'AAA', 'AAA', 'AAA', 'AAA', 'JJJ', 'JJJ',…
alan
  • 13
  • 1
0
votes
1 answer

What does this even mean SUM(sessions.MIN(transactions.amount)) from the official featuretools documentation?

Can someone help me with how such a feature created automatically by Deep Feature Synthesis will actually work? I came across so many such features created and I'm not sure of how it works. I am unable to hook up the whole code here, although I am…
0
votes
1 answer

How to retain feature columns after running dfs on entity?

I tried Featuretools example mentioned at following URL: https://docs.featuretools.com/index.html Customers dataframe has following data: In [4]: customers_df Out[4]: customer_id zip_code join_date date_of_birth 0 1 60091…
enjoystar
  • 5
  • 4
0
votes
1 answer

Can Featuretools select a set of features from sets of features?

I have a dataset with 100+ inputs, and 6 inputs can be combined into a set, they collected from the same sensor. I same trying to find which sensor's data are the best features. Can Featuretools do feature engineering and find the best set of…
drerD
  • 629
  • 1
  • 9
  • 24
0
votes
1 answer

Unable to add relationship because dtypes don't match in Featuretools

The following error arises when trying to add a relationship between two entities in Featuretools Unable to add relationship because ID in metadata is Pandas `dtype category` and ID in transactions is Pandas `dtype category` Note, the Series are…
Georg Heiler
  • 16,916
  • 36
  • 162
  • 292
0
votes
1 answer

Featuretools relationship with non unique join key

Assuming I have two tables, one with metadata about a customer with field customer_id and an events table recorded from website clickstream events with fields customer_id, date. Obviously, the second table might have several non unique events…
Georg Heiler
  • 16,916
  • 36
  • 162
  • 292
0
votes
1 answer

Featuretools handling of multiple join keys

How are one to many relationships with multiple join keys represented in featuretools? Is there some integrated approach - or should the join keys manually be concatenated into a single column?
Georg Heiler
  • 16,916
  • 36
  • 162
  • 292
0
votes
1 answer

Understanding the PercentTrue primitive output in featuretools

I've been playing with the predict-appointment-noshow notebook tutorial and I'm confused by the output of the PERCENT_TRUE primitive. My understanding is that after feature generation, a column like locations.PERCENT_TRUE(appointments.sms_received)…
ketone
  • 23
  • 4
1 2 3
14
15