My dataset consists of video game titles from various websites, formatted in different ways. Here's my example:
"The Legend Of Zelda: Wind Waker, Nintendo"
"The Legend Of Zelda: The Wind Waker"
"The Legend Of Zelda: Wind Waker, Nintendo"
"The Legend Of Zelda: Wind Waker, Nintendo"
"Zelda: Wind Waker Hd Nintendo Wii U Game"
"The Legend Of Zelda: The Wind Waker"
"Legend Of Zelda: The Wind Waker Hd (nintendo Wii"
"The Legend Of Zelda: Wind Waker Of Game (nintendo"
"The Legend Of Zelda: The Wind Waker Nintendo Wii"
"Nintendo Wii U Game Zelda: Wind Waker Hd"
"The Legend Of Zelda: The Wind Waker Hd Wii U"
"The Legend Of Zelda: Wind Waker, Nintendo Pinterest"
"Zelda: Hd (nintendo Wii The"
"The Legend Of Zelda: The Wind Waker Hd Wii U Pinterest"
"The Legend Of Zelda: The Wind Waker Hd"
"Legend Of Zelda: Wind Waker Hd (nintendo Wii"
"The Legend Of Zelda: The Wind Waker Hd"
"The Legend Of Zelda: Wind Waker, Nintendo Wii U"
"The Legend Of Zelda Wind Hd"
"Zelda Wind Waker Hd"
"The Legend Of Zelda: Wind Waker, Nintendo Pinterest"
"The Legend Of Zelda Wind Waker Wii U Nintendo"
"Wii U The Legend Of Zelda: The Wind Waker Hd"
"Zelda: Wind Waker Hd"
"The Legend Of Zelda: The Wind Waker Hd Game Wii"
"The Legend Of Zelda: The Wind Waker Hd Nintendo Wii U"
"Zelda: Wind Waker Hd"
"The Legend Of Zelda The Wind Waker Hd Wii U"
The correct output for this data would be:
The Legend Of Zelda: The Wind Waker HD
- Title
Wii U
- Platform
Nintendo
- Publisher
I can feed a model 100's of these datasets, with what I would then expect as the correct output, and then hope that the model "learns" for future datasets of titles what an expected output might be.
Is this something that Machine Learning can do? What model should I use? I have never done anything with ML before so I'm unsure if this is a good use case for it.