I am creating a project to analyze stock data using Hadoop for my college project. I have created the cluster on Google cloud using Cloudera manager. I am a complete beginner. From what I have researched, I need to obtain an API to get the data and process it using MAPR code or any other method. However, I don't know how to integrate that API to import that data into the hadoop cluster.
Link to the API I'm trying to use: https://www.alphavantage.co/
A python wrapper class: https://github.com/RomelTorres/alpha_vantage (I dont know the use)
Someone please help me and explain to me the basic processes involved to obtain the data. I am open to suggestions as to another API or other suggestions. Any kind of help is highly appreciated.