Recently I need to create a histogram which shows frequency distribution of a large data set. This should be a simple job if the data set is small. However, the data set I need to plot contains about 800000000 numbers (lets assume each number takes 4 bytes), and they are all stored in one text file, one number each line. The text file is about 4 GB. I tried GNUPLOT but it complains not enough memory for handling this data set. Can someone suggest how to solve this problem, or any other tools for doing this job?
Thanks, Tom