I have installed CDH 5.5.1
with Hue
, Hadoop
, Spark
, Hive
, Oozie
, Yarn
and ZooKeeper
.
When I run a Spark
job or MapReduce
job, Hue
displays a issue in the job history. The problem is that when I restart the CDH
services (Not the physical nodes), it removes all the job histories that were before the restart.
On Hadoop there are several files that I suspect have information about the task and might be the ones that hold the job information. Their hadoop paths are:
/tmp/logs/user/logs/
/user/history/done/2016/
I have looked for it in the Cloudera Manager
configuration page, Hue configuration page and some configuration files with no success. I don't know how to prevent this removal. Am I missing something?