I would like to collect aggregate usage metrics from a Cloudera 5.4.4 Hadoop cluster. Some of the metrics in my mind are as below:
- Average CPU utilization of the cluster per day/ per week
- Top n longest running jobs/queries on Hadoop
- Top n users who use the cluster most (by utilization, by number of submitted jobs)
- Cluster disk usage vs disk capacity
- Cluster disk usage growth over time
Are there any APIs/resources/tools etc that I could use for starting with this? I don't think I am entirely sure of where to begin from. Any starting point would be greatly appreciated. Also, please do share your experience with cluster usage metrics, if you have had any.
Thanks in advance!