-4

does anyone some good datasets for classification via NLP?

The BBC News Dataset worked very well for me, and this is kind of what I am looking for. (classification of text documents to 3-10 labels)

Sadly, as I want to work with deep learning classification, the dataset is too small.

Can anybody help me out?

  • You can check some datasets here: https://paperswithcode.com/datasets?task=text-classification&mod=texts&page=1 – u1234x1234 Oct 13 '22 at 20:02
  • There are lots of options. Practically any tutorial you find online proposes a dataset. But you need to be more specific, there are many different kinds of classification tasks in NLP: spam detection, topic classification, author attribution, etc. – Erwan Oct 14 '22 at 08:56

1 Answers1

1

You can check the repo https://github.com/niderhoff/nlp-datasets to find what you want, and also, you can search specifical dataset in https://appen.com/pre-labeled-datasets/.

AbnerWang
  • 11
  • 2