Questions tagged [featuretools]

Featuretools is a Python library for automated feature engineering on relational datasets using a technique called Deep Feature Synthesis.

Featuretools is an open source python library for automated feature engineering for tabular relational datasets.

Resources

221 questions
1
vote
1 answer

How do you search for particular features?

At last when I tried featuretools I was searching for a particular feature which I was expecting. When you have > 30 feature it is kind of time consuming to find the feature. Has the feature_names object (second return object of the dfs method) a…
1
vote
1 answer

How does df.groupby('A').agg('min') translate to featuretools?

Say I have this simple snippet of code. I will group, aggregate, and merge the dataframe: Using Pandas: Data df = pd.DataFrame({'A': [1, 1, 2, 2], 'B': [1, 2, 3, 4], 'C': [0.3, 0.2, 1.2, -0.5]}) df: A B C 0…
1
vote
1 answer

How do you use MultiplyNumeric for a date transformed to a weight and a value of the same table?

My primary aim is a feature which considers more recent information of higher value. So, the Idea is to calculate via a new primitive transformation "WeightTimeUntil" a weighing factor which afterwards could be used by the transformation primitive…
1
vote
1 answer

Implementing feature store for ML pipline

There is lot of discussion about feature store in ML pipeline.Typical use case is res usability of feature among different scientist working on different model. Is featuretool supports feature store concept?
Vikram Ranabhatt
  • 7,268
  • 15
  • 70
  • 133
1
vote
0 answers

Custom featuretools primitive for category frequency

I have a dataframe and would like to create a custom feature to the frequency of the category. How can I achieve this using a featuretools custom primitive? In pandas, this looks like the following: df = pd.DataFrame({'category': ['a' ,'b',…
alacarter
  • 349
  • 2
  • 11
1
vote
1 answer

Unable to deploy featuretools as EntitySet.read_pickle() method missing

With reference to the deployment methods for featuretools 'v0.7.0' as mentioned on deployment page I have my feature_defs saved using ft.save_features method however upon successfully loading the defs using ft.load_features I found that the next…
Milind Dalvi
  • 826
  • 2
  • 11
  • 20
1
vote
1 answer

Unable to create time-based features in a dataset which has other numerical and categorical variable

I want featuretools to create features based on time index and cutoff time that I have declared in my entity set. I have a dataset with time variables as well as numerical and categorical variable. There is an ITEMID column, each ITEMID has 2 to 12…
Vikrant
  • 139
  • 1
  • 12
1
vote
1 answer

Can feature tools save feature list directly to s3?

I'm trying to persist the list of features returned from Deep Feature Synthesis directly to S3. If persisting locally, I can use "ft.save_features(features,pathtofile)". Is there anyway to pass in an S3 url to this method?
Nick Bernini
  • 121
  • 4
1
vote
1 answer

Including only certain features when running deep feature synthesis?

For example one of my entities has two sets of IDs. One that is continuous (which apparently is necessary to create the EntitySet), and one to use as a foreign key when merging with my other table. This results in featuretools including the ID in…
selib
  • 39
  • 2
1
vote
0 answers

Installing featuretools library in google datalab gives error

I am trying to install featuretools library in Google datalabs. Tried with few versions but still it is giving error of incompatibility. Also tried in python 2 and python 3 both. !pip install -I featuretools==0.5.1 Tried 0.6.1, 0.6.0 and 0.5.1 but…
Akash
  • 359
  • 1
  • 7
  • 27
1
vote
1 answer

Feature Tools default cutoff_time

If I didn't specify the cutoff_time which date it will be the reference for the primitive TIME_SINCE_LAST? I cannot find it in the documentation. Thanks!
Rui Figueiredo
  • 110
  • 1
  • 5
1
vote
1 answer

Automated feature generation for time series problems - Featuretools

I'm trying to use featuretools to generate features to help me predict the number of museum visits next month. Can featuretools generate features for time series? Should I changed the data so that the id is the month or can featuretools do it…
Shahad
  • 11
  • 2
1
vote
1 answer

Are there built-in primitives for interactions in Feature tools?

are there built-in primitives performing absolute and relative differences between two numeric columns? Two date columns?
Sergey Skripko
  • 336
  • 1
  • 8
1
vote
1 answer

How to fix featurtools Type Error on colab

As soon as I enter time_index='date' parameter in es.entity_from_dataframe(..) in the code below, ft.dfs(..) throws a long list of errors starting from Type Error. I am using google colab with featuretools version 0.4.1. import pandas as pd import…
1
vote
1 answer

Using Featuretools to aggregate per time time of day

I'm wondering if there's any way to calculate all the same variables I already am using deep feature synthesis (ie counts, sums, mean, etc) for different time segments within a day? I.e. count of morning events (hours 0-12) as a separate variable…