I am extracting the required content of the html files that are stored in the warc.gz file. But i am not sure how many html files are in the .gz achieve record.
Asked
Active
Viewed 86 times