Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
1
vote
1 answer

Is it possible to skip the processing of one column?

I want to keep one column of my dataframe in its original state, not applying any primitive to it, is it possible?
Rui Figueiredo
  • 110
  • 1
  • 5
1
vote
0 answers

Using featuretools for text data (word count, tfidf)

Featuretools is best for relational categorical and numerical data. Regarding text it seems that it only counts text length and some other very basic stats. What would be the best pipeline for preparing textual data for featuretools? Should it be…
Aidis
  • 1,272
  • 4
  • 14
  • 31
1
vote
1 answer

How to select features from feature_def created through deep feature synthesis

I am using deep feature synthesis to create new features. How can I select features from feature_def. For example, I need to select all the features with string "Age" in it. I tried the following code which gave me an error "argument of type…
Abin John Thomas
  • 159
  • 2
  • 13
1
vote
1 answer

What does "log" in feature_matrix[["MEAN_SUNDAY(log.value, datetime)", "MEAN_SUNDAY(log.value_2, datetime)"]] from featuretools mean?

I have a doubt if log.value and log.value_2 is, 1) The logarithm of column "value" and "value_2" along with "datetime" is sent as two inputs to MEAN_SUNDAY to perform the user-defined function. 2) The log is just an entity from the entityset which…
1
vote
0 answers

Error while installing featuretools on mac

System specification: MacOS Mojava Version:10.14.1 (18B75) I am trying to install featuretools on mac using pip. Featuretools is a python library used to generate new set of features using the existing features. command used: python -m pip…
shre2306
  • 65
  • 2
  • 8
1
vote
0 answers

error "group() got an unexpected keyword argument 'observed' " when using the demo

I tried the demo. When I run [in] feature_matrix, features_defs = ft.dfs(entityset=es, target_entity="customers") I got an error message like this group() got an unexpected keyword argument 'observed' looking forward to your help. Thanks.
Xu Zhang
  • 17
  • 2
1
vote
1 answer

Features are not being generated for my Entityset Set Up in featuretools

I encountered a problem when I tried to create relationships between entititysets (using my own data). There is no error, but it just doesn't create features for one of my entities (the "prods" entity), although everything should be connected just…
1
vote
1 answer

How to record constants derived by FeatureTools when using Deep Feature Synthesis

When FeatureTools performs deep feature synthesis, is there a way for it to record constant values it has derived? For example, I have a with many rows like this: | loan_id | loan_term | |---------|:---------:| | a | 12 | | ... |…
alacarter
  • 349
  • 2
  • 11
1
vote
1 answer

Featuretools with a single table and the Min primitive gives an error

My environment is: Operating system version.... Windows-10-10.0.17134-SP0 Python version is........... 3.6.5 pandas version is........... 0.23.0 numpy version is............ 1.14.3 Featuretools................ 0.3.0 and my pandas dataframe looks…
user274610
  • 509
  • 9
  • 18
1
vote
0 answers

Can you merge entities in featuretools?

Is it possible to merge entities in an entity set such as dataframes in pandas? Something that is the reverse of normalizing entities. If I have the following es: Entityset: transactions Entities: sessions [Rows: 35, Columns: 6] products…
Sarah
  • 13
  • 3
1
vote
1 answer

Error in featuretools entity set

I got the following error message when I define entity set with featuretools: AttributeError: 'str' object has no attribute 'copy' Here's my code: import featuretools.variable_types as vtype es = ft.EntitySet(id="Policy") es =…
1
vote
1 answer

es.normalize_entity error variable not found in entity

Am using the featuretools documentation to learn entityset and am currently getting error KeyError: 'Variable: device not found in entity' for the following piece of code: import featuretools as ft data = ft.demo.load_mock_customer() customers_df =…
S Belvi
  • 11
  • 2
1
vote
1 answer

Using featuretools on lagged features on a single table

I have seen usually the suggested paradigm for using featuretools to create aggregations is to have separate entities linked by a relationship. Now, my case is a bit different, I have a single table that looks like this user_id | time_id |…
giobatta912
  • 180
  • 7
1
vote
0 answers

how to use featuretools get_pandas_data_slice

Can someone show an example or give some more explanation around how to use featuretools.EntitySet.get_pandas_data_slice()? I've looked at the API documentation but am unclear on the parameters filter_entity_ids, index_eid, and…
Nick Bernini
  • 121
  • 4
1
vote
1 answer

How do I prevent data leakage with featuretools

I love featuretools, but I'm having a hard time working it into my data science workflow because I'm concerned about data leakage. I think that the way to prevent this would be to run deep feature synthesis on the training set, then join the…