Sending Usage and Diagnostic Data to Cloudera

With Cloudera Manager you can take regularly-scheduled snapshots of the state of your cluster and automatically send it anonymously to Cloudera. This helps Cloudera improve and optimize Cloudera Manager. If you are a Cloudera Enterprise user, you can also trigger the collection of diagnostic data and send it to Cloudera Support to aid in resolving a problem you may be having. The following sections provide more information about these features.

Configuring Anonymous Usage Data Collection

You can configure Cloudera Manager to send anonymous usage information using Google Analytics to Cloudera. The information helps Cloudera improve Cloudera Manager.

To configure anonymous usage data collection:

  1. From the Administration tab, select Settings.
  2. Under the Other category, set the Allow Usage Data Collection option to enable or disable anonymous usage data collection.
  3. Click Save Changes.

Sending Diagnostic Data to Cloudera

To help with solving problems when using Cloudera Manager on your cluster, you can collect diagnostic data on a regular schedule, and have it automatically sent to Cloudera. Cloudera Manager is configured by default to collect data weekly and to send it automatically. You can schedule the frequency of data collection on a daily, weekly, or monthly schedule, or disable the scheduled collection of data entirely. Separately you can disable the automatic sending of data to Cloudera — see Disabling the Automatic Sending of Diagnostic Data. You can also send a collected data set manually.

If you are a Cloudera Enterprise customer, you can also trigger a collection and send the resulting diagnostic data to Cloudera Support on demand to aid in diagnosing problems you may be having.

  Note:

To automatically send diagnostic data requires the Cloudera Manager Server host to have Internet access, and be configured for sending data automatically. If your Cloudera Manager server does not have Internet access, you can manually send the diagnostic data as described below.

This section covers the following topics:

Configuring the Frequency of Diagnostic Data Collection

By default, Cloudera Manager collects diagnostic data on a weekly basis, and automatically sends it to Cloudera. You can change the frequency to daily, weekly, monthly, or never. If you are a Cloudera Enterprise customer and you set the schedule to Never you can still collect and send data to Cloudera on demand.

To change the frequency of diagnostic data collection:

  1. From the Administration tab, select Settings.
  2. Under the Support category, click in the field for the property Send diagnostic Data to Cloudera Automatically and select the frequency you want.
  3. You can change the day and time of day that the collection will be performed.
  4. Click Save Changes

You can see the setting for the current data collection frequency under the Support menu in the main navigation bar.

Collecting and Sending Diagnostic Data to Cloudera on Demand

  Note:

This feature is available only with Cloudera Enterprise.

The feature described in this section is not available in Cloudera Manager with Cloudera Standard.

If you have been using the Cloudera Enterprise Trial Edition, this feature will no longer be available after your trial license expires.

To obtain a license for Cloudera Enterprise, please contact sales@cloudera.com. When you install your Enterprise license, this feature will be enabled.

As a Cloudera Enterprise customer, you can have Cloudera Manager collect a set of diagnostic If you do not want data sent automatically, you must disable that feature (see Disabling the Automatic Sending of Diagnostic Data).

To collect and send diagnostic data to Cloudera:

  1. Click the Support menu link.
  2. Choose Send Diagnostic Data. This opens the Send Diagnostic Data form. Note that at the top of the form it tells you whether Cloudera Manager is configured to send the data automatically or not. See the instructions below to change this.
  3. Fill in or change the information here as appropriate.
    • To change the System Identifier, from the Administration tab select Settings and go to theOther category.
    • Cloudera Manager pre-populates the End Time based on the setting of the Time Range Selector. You should change this to be a few minutes after you observed the problem or condition that you are trying to capture. Note that the time range is based on the time zone of the host where Cloudera Manager server is running.
    • If you have a support ticket open with Cloudera support, please include the support ticket number in the field provided.
  4. Click Collect Diagnostic Data. A Running Commands window shows you the progress of the data collection steps. When these steps are complete, the collected data is sent to Cloudera.

Disabling the Automatic Sending of Diagnostic Data

If you do not want data sent to Cloudera automatically, you can disable this feature. The data you collect will be saved

To disable sending diagnostic data to Cloudera automatically:

  1. From the Administration tab, select Settings.
  2. Under the Support category, uncheck the box for Send diagnostic Data to Cloudera Automatically.
  3. Click Save Changes

Manually Sending Diagnostic Data to Cloudera

  Note:

This feature is available only with Cloudera Enterprise.

The feature described in this section is not available in Cloudera Manager with Cloudera Standard.

If you have been using the Cloudera Enterprise Trial Edition, this feature will no longer be available after your trial license expires.

To obtain a license for Cloudera Enterprise, please contact sales@cloudera.com. When you install your Enterprise license, this feature will be enabled.

  Note:

Automatically sending diagnostic data may fail sometimes and return an error message of "Could not send data to Cloudera." To work around this issue, you can manually send the data to Cloudera Support as described below.

To manually send collected diagnostic data to Cloudera:

  1. Click the Support menu link.
  2. Choose Send Diagnostic Data. This opens the Send Diagnostic Data form. Note that at the top of the form it tells you whether Cloudera Manager is configured to send the data automatically or not. See the instructions above to change this.
  3. Fill in or change the information in the form as appropriate. Cloudera Manager pre-populates the start and end times, but you can change them. If you have a support ticket open with Cloudera support, please include the support ticket number in the field provided.
  4. Click Collect Diagnostic Data. A Command Details window shows you the progress of the data collection steps.
  5. Click Download Result Data to download and save a zip file of the information collected, on a host that has Internet access.
  6. Download this script and run the following command on that host to send the data to Cloudera Support:
    python phone_home.py --file [file-you-downloaded]
  Note:

If you want to send your file manually but choose not to download the script, you can follow the instructions documented on the Cloudera Customer Portal at Get Support - Uploading Files for Cloudera Support.

What Data Does Cloudera Manager Collect?

Cloudera Manager collects and returns a significant amount of information about the health and performance of the cluster. It includes the following:

  • Up to 1000 Cloudera Manager Audit Events: Configuration changes, add/remove of users, roles, services, etc.
  • Data about the cluster structure which includes a list of all hosts, roles, and services along with the configs that are set through Cloudera Manager. Where passwords are set in Cloudera Manager, the passwords are not returned.
  • Cloudera Manager License and version number.
  • One day's worth of Cloudera Manager events: This includes critical errors Cloudera Manager watches for and more.
  • Current health information for hosts, service, and roles. Includes results of health tests run by Cloudera Manager.
  • Heartbeat information from each host, service, and role. These include status and some information about memory/disk/processer usage.
  • The results of running Host Inspector.
  • One day's worth of Cloudera Manager metrics.
      Note: If you are using Cloudera Standard, Host metrics are not included.
  • A download of the debug pages for Cloudera Manager roles.
  • For each machine in the cluster, the result of running a number of system-level commands on that machine.
  • Logs from each role on the cluster, as well as the CM server and agent logs.