I have a large dataset that I would like to plot in an IPython notebook.
I read the ~0.5GB .csv file into a Pandas DataFrame
using read_csv
, this takes about two minutes. Then I try to plot this data.
data = pd.read_csv('large.csv')
output_notebook()
p1 = figure()
p1.circle(data.index, data['myDataset'])
show(p1)
My browser spins and does not show me any plots. I have tried the following:
output_file()
instead ofoutput_notebook()
- Graphing using a
ColumnSource
object as thesource
argument tocircle()
- Downsampling my data to something more manageable.
Bokeh claims on its website to offer "high-performance interactivity over very large or streaming datasets". How do I visualize these large datasets without my computer grinding to a halt?