I'm currently making a machine learning model for a student project, and I'm still deciding what model I should use. Here's the brief I was given:
Global Terrorism Database (GTD) is an open-source database including information on terrorist events around the world from 1970 through 2014. Some portion of the attacks have not been attributed to a particular terrorist group. Use attack type, weapons used, description of the attack, etc. to build a model that can predict what group may have been responsible for an incident.
The data frame has:
- 134 columns, about 100,000 rows
- many of the columns have missing values
- I've only been given 5 days to submit my final work, so I can't spend a prolonged period training the model
I'm leaning towards using a backpropogation neural network, as I believe it can handle the missing values, though a random forest might also be viable given the limited amount of time I have to train it. I've done a lot of research on the various pros and cons of common ML models, but any additional advise would be greatly appreciated.