I'm working on a script that will eventually allow for data exploration using ipywidgets. I've made some parts work for a dynamic number of columns someone may be interested in filtering on, but implementing the interact function in a dynamic manner is proving to be difficult. Sample code below that I've been running in Jupyter:
import ipywidgets as widgets
from ipywidgets import interact
import pandas as pd
df = pd.read_csv('https://raw.githubusercontent.com/yankev/testing/master/datasets/nycflights.csv')
df = df.drop(df.columns[[0]], axis=1)
filter_cols = list(['origin','dest','carrier']) #list N columns we want to filter on
filter_df = df[filter_cols] #pull selected N columns from dataframe
filter_df.drop_duplicates(inplace=True) #remove duplicates
#loop through columns and create variables/widgets
for idx, val in enumerate(filter_cols):
#creates N variables (filter0, filter1, filter2) with unique values for each column with an All option
globals()['filter{}'.format(idx)] = ['All']+sorted(filter_df[val].unique().tolist())
#creates N widgets (widget0, widget1, widget2) for interact function below
globals()['widget{}'.format(idx)] = widgets.SelectMultiple(
options=globals()['filter{}'.format(idx)],
value=['All'],
description=val,
disabled=False
)
#looking to make this function dynamic based on the number of columns we want to filter by
#filters down source dataframe based on widget value selections
def viewer(a, b, c = list()):
#if widget selection is 'All', pass the full filter list, else filter only to what is selected in the widget
return df[df['origin'].isin(filter0 if a==('All',) else a)
& df['dest'].isin(filter1 if b==('All',) else b)
& df['carrier'].isin(filter2 if c==('All',) else c)].shape[0]
#displays N filters
#returns record count for filter combination
interact(viewer, a=widget0, b=widget1, c=widget2)
The second half of the code, after the loop, is what I'd like to make dynamic. As it stands now, I'd have to change the column name callouts and add/remove code for any additional filters. It would be nice to limit the amount of manipulation to a few points in the script.
Any suggestions are much appreciated. Thanks!