Friends, We are trying work on a problem where we have a dump of only reviews but there is no rating in a .csv file. Each row in .csv is one review given by customer of a particular product, lets a TV.
Here, I wanted to do classification of that text into below pre-defined category given by the domain expert of that products:
- Quality
- Customer
- Support
- Positive Feedback
- Price
- Technology
Some reviews are as below:
- Bought this product recently, feeling a great product in the market.
- Was waiting for this product since long, but disappointed
- The built quality is not that great
- LED screen is picture perfect. Love this product
- Damm! bought this TV 2 months ago, guess what, screen showing a straight line, poor quality LED screen
- This has very complicated options, documentation of this TV is not so user-friendly
- I cannot use my smart device to connect to this TV. Simply does not work
- Customer support is very poor. I don't recommend this
- Works great. Great product
Now, with above 10 reviews by 10 different customers, how do I categorize them into the given buckets (you can call multilabel classification or Named Entity Recognition or Information extraction with sentiment analysis or be it anything)
I tried all NLP word frequency counting related stuff (in R) and referred StanfordNLP (https://nlp.stanford.edu/software/CRF-NER.shtml) and many more. But could not get a concrete solution.
Can anybody please guide me how can we tackle this problem? Thanks !!!