I am working on my bachelor thesis and my topic is to benchmark nodejs applications. I have an I/O intensive application where I would need some sample files.
Therefore it would be great if I had a lot of small sized files, some medium files and some big files (>1gb). But they should represent real data (eg. pictures, pdfs, documents, archives, ...)
If you think that this should not be asked on stackoverflow please tell me where i can ask this.
Do you know where I can get such sample datasets?