I have some rather large datasets that I'm working with. Essentially, I'm running some of the tools from scikit-learn on memory-mapped numpy arrays as it seems to allow me to work with larger datasets than the memory on my computer would otherwise allow.
I sort of prefer the joblib to do the memory mapping, because you only have to specify the file.
But I can't seem to figure out how to allocate a new empty, say 100 million by 200 numpy array using only joblib without loading all into memory.
Thanks!