I'm using datasets
library by huggingface to load csv dataset stored locally. The problem is, the dataset is compressed and is stored as a csv.gz
file. Therefore, I'm not able to load it using load_dataset('csv', '<local_file_path>.csv')
method in huggingface.
Asked
Active
Viewed 255 times
0
1 Answers
0
Just decompress it with gunzip:
gunzip -v yourdataset.csv.gz

cronoik
- 15,434
- 3
- 40
- 78
-
So, that means NO? – Mahavir I Apr 03 '22 at 15:25
-
@MahavirI: At the moment the answer is no. Loading a `csv.gz` leads to an error on my machine (the hard disk is filled with GB of cache files even if the file is very small. Smells like a bug.). – cronoik Apr 03 '22 at 16:25