I'm trying to load my csv file into datalab. But the csv file is too large to load. Even if I managed to do that, it'll take too long to do the preprocessing.
I'm thinking of using Keras to do ML on this dataset. The questions are:
- How do I use a data generator to feed Keras my raw data?
- What about the data preprocessing, shall I do it in dataprep or dataflow or is it just fine in datalab?
- Is there any way to speed up the training process? Now, I have to leave the datalab window open for a long time for the training to finish. I don't feel comfortable leaving the webpage open for such a long time.
Thanks!