Backing up the Databases

It's important that you periodically back up the databases that Cloudera Manager uses to store configuration, monitoring, and reporting data. Be sure to back up all of the databases you are using with Cloudera Manager:

  • Cloudera Manager database: This is the most important database to back up. This database contains all the information about what services you have configured, their role assignments, all configuration history, commands, users, and running processes. This is a relatively small database, typically smaller than 100MB.
  • Activity Monitor database: Contains information about past activities. In large clusters, this database can become very large.
  • Service Monitor database: Contains monitoring information about daemons. In large clusters, this database can become very large.
  • Report Manager database: Keeps track of disk utilization over time. This database is typically medium-sized.
  • Host Manager database: Contains information about host status. The number of hosts in the cluster affects this database's size, so the database size varies, but the database is typically large in deployments with many hosts.