alerts
|
Alerts |
events |
CDH3, CDH4 |
can_commit_avg_time
|
Can Commit Average Time |
ms |
CDH3 |
can_commit_num_ops
|
Can Commit Operations |
operations |
CDH3 |
cgroup_mem_page_cache
|
Page cache usage of the role's cgroup |
bytes |
CDH3, CDH4 |
cgroup_mem_rss
|
Resident memory of the role's cgroup |
bytes |
CDH3, CDH4 |
cgroup_mem_swap
|
Swap usage of the role's cgroup |
bytes |
CDH3, CDH4 |
cgroup_read_bytes
|
Bytes read from all disks by the role's cgroup |
bytes |
CDH3, CDH4 |
cgroup_read_ios
|
Number of read I/O operations from all disks by the role's cgroup |
ios |
CDH3, CDH4 |
cgroup_total_cpu_system
|
CPU usage of the role's cgroup |
seconds |
CDH3, CDH4 |
cgroup_total_cpu_user
|
User Space CPU usage of the role's cgroup |
seconds |
CDH3, CDH4 |
cgroup_write_bytes
|
Bytes written to all disks by the role's cgroup |
bytes |
CDH3, CDH4 |
cgroup_write_ios
|
Number of write I/O operations to all disks by the role's cgroup |
ios |
CDH3, CDH4 |
commit_pending_avg_time
|
Commit Pending Average Time |
ms |
CDH3 |
commit_pending_num_ops
|
Commit Pending Operations |
operations |
CDH3 |
done_avg_time
|
Done Average Time |
ms |
CDH3 |
done_num_ops
|
Done Operations |
operations |
CDH3 |
events_critical
|
Critical Events |
events |
CDH3, CDH4 |
events_important
|
Important Events |
events |
CDH3, CDH4 |
events_informational
|
Informational Events |
events |
CDH3, CDH4 |
failed_dirs
|
Failed Directories |
directories |
CDH3, CDH4 |
fatal_error_avg_time
|
Fatal Error Average Time |
ms |
CDH3 |
fatal_error_num_ops
|
Fatal Error Operations |
operations |
CDH3 |
fd_max
|
Maximum number of file descriptors |
file descriptors |
CDH3, CDH4 |
fd_open
|
Open file descriptors |
file descriptors |
CDH3, CDH4 |
fs_error_avg_time
|
Filesystem Error Average Time |
ms |
CDH3 |
fs_error_num_ops
|
Filesystem Error Operations |
operations |
CDH3 |
get_map_completion_events_avg_time
|
Get Map Completion Events Average Time |
ms |
CDH3 |
get_map_completion_events_num_ops
|
Get Map Completion Events Operations |
operations |
CDH3 |
get_protocol_version_avg_time
|
Get Protocol Version Average Time |
ms |
CDH3 |
get_protocol_version_num_ops
|
Get Protocol Version Operations |
operations |
CDH3 |
get_task_avg_time
|
Get Task Average Time |
ms |
CDH3 |
get_task_num_ops
|
Get Task Operations |
operations |
CDH3 |
host_health
|
The health of the host on which the role runs |
kaiser health |
CDH3, CDH4 |
host_subject_status_1
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_10
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_2
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_3
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_4
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_5
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_6
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_7
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_8
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
host_subject_status_9
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
jvm_blocked_threads
|
Blocked threads |
threads |
CDH3, CDH4 |
jvm_gc_count
|
Number of garbage collections |
garbage collections |
CDH3, CDH4 |
jvm_gc_time_ms
|
Total time spent garbage collecting (ms) |
ms |
CDH3, CDH4 |
jvm_heap_committed_mb
|
Total amount of committed heap memory (MB) |
MB |
CDH3, CDH4 |
jvm_heap_used_mb
|
Total amount of used heap memory (MB) |
MB |
CDH3, CDH4 |
jvm_max_memory_mb
|
Maximum allowed memory (MB) |
MB |
CDH3, CDH4 |
jvm_new_threads
|
New threads |
threads |
CDH3, CDH4 |
jvm_non_heap_committed_mb
|
Total amount of committed non-heap memory (MB) |
MB |
CDH3, CDH4 |
jvm_non_heap_used_mb
|
Total amount of used non-heap memory (MB) |
MB |
CDH3, CDH4 |
jvm_runnable_threads
|
Runnable threads |
threads |
CDH3, CDH4 |
jvm_terminated_threads
|
Terminated threads |
threads |
CDH3, CDH4 |
jvm_timed_waiting_threads
|
Timed waiting threads |
threads |
CDH3, CDH4 |
jvm_total_threads
|
Total threads |
threads |
CDH3, CDH4 |
jvm_waiting_threads
|
Waiting threads |
threads |
CDH3, CDH4 |
kaiser_health
|
Health value computed by Cloudera Manager |
kaiser health |
CDH3, CDH4 |
log_error
|
Logged Errors |
messages |
CDH3, CDH4 |
log_fatal
|
Logged Fatals |
messages |
CDH3, CDH4 |
log_info
|
Logged Infos |
messages |
CDH3, CDH4 |
log_warn
|
Logged Warnings |
messages |
CDH3, CDH4 |
login_failure_avg_time
|
Average Failed Login Time |
ms |
CDH3 |
login_failure_num_ops
|
Login Failures |
operations |
CDH3 |
login_success_avg_time
|
Average Successful Login Time |
ms |
CDH3 |
login_success_num_ops
|
Login Successes |
operations |
CDH3 |
map_task_slots
|
Map Task Slots |
slots |
CDH3, CDH4 |
maps_running
|
Running map tasks |
tasks |
CDH3, CDH4 |
ping_avg_time
|
Ping Average Time |
ms |
CDH3 |
ping_num_ops
|
Ping Operations |
operations |
CDH3 |
reduce_task_slots
|
Reduce Task Slots |
slots |
CDH3, CDH4 |
reduces_running
|
Reduces Running |
tasks |
CDH3, CDH4 |
report_diagnostic_info_avg_time
|
Report Diagnostic Info Average Time |
ms |
CDH3 |
report_diagnostic_info_num_ops
|
Report Diagnostic Info Operations |
operations |
CDH3 |
report_next_record_range_avg_time
|
Report Next Record Range Average Time |
ms |
CDH3 |
report_next_record_range_num_ops
|
Report Next Record Range Operations |
operations |
CDH3 |
role_start_time
|
Role start time |
timestamp |
CDH3, CDH4 |
rpc_authentication_failures
|
RPC Authentication Failures |
failures |
CDH3 |
rpc_authentication_successes
|
RPC Authentication Successes |
successes |
CDH3 |
rpc_authorization_failures
|
RPC Authorization Failures |
failures |
CDH3 |
rpc_authorization_successes
|
RPC Authorization Successes |
successes |
CDH3 |
rpc_call_queue_length
|
RPC Call Queue Length |
items |
CDH3 |
rpc_num_open_connections
|
Open RPC Connections |
connections |
CDH3 |
rpc_processing_time_avg_time
|
Average RPC Processing Time |
ms |
CDH3 |
rpc_processing_time_num_ops
|
RPCs Processed |
operations |
CDH3 |
rpc_queue_time_avg_time
|
Average RPC Queue Time |
ms |
CDH3 |
rpc_queue_time_num_ops
|
RPCs Queued |
operations |
CDH3 |
rpc_received_bytes
|
RPC Received Bytes |
bytes |
CDH3 |
rpc_sent_bytes
|
RPC Sent Bytes |
bytes |
CDH3 |
scm_health
|
Health value computed by Cloudera Manager |
SCM health |
CDH3, CDH4 |
scm_health_reason
|
Reason for health value computed by Cloudera Manager |
SCM health reason |
CDH3, CDH4 |
scm_process_state
|
Process State according to Cloudera Manager |
SCM process state |
CDH3, CDH4 |
scm_role_state
|
Role state according to Cloudera Manager |
SCM role state |
CDH3, CDH4 |
shuffle_error_avg_time
|
Shuffle Error Average Time |
ms |
CDH3 |
shuffle_error_num_ops
|
Shuffle Error Operations |
operations |
CDH3 |
shuffle_exceptions_caught
|
Shuffle Handler Exceptions Caught |
exceptions |
CDH3, CDH4 |
shuffle_failed_outputs
|
Shuffle Handler Failed Requests |
requests |
CDH3, CDH4 |
shuffle_handler_busy_percent
|
Shuffle Handler Busy Percentage |
percent |
CDH3, CDH4 |
shuffle_output_bytes
|
Shuffle Output |
bytes |
CDH3, CDH4 |
shuffle_success_outputs
|
Shuffle Handler Successful Requests |
requests |
CDH3, CDH4 |
slave_master_connectivity
|
Indicates whether the master node detects the slave as connected |
master connectivity |
CDH3, CDH4 |
status_update_avg_time
|
Status Update Average Time |
ms |
CDH3 |
status_update_num_ops
|
Status Update Operations |
operations |
CDH3 |
subject_status_1
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_10
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_2
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_3
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_4
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_5
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_6
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_7
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_8
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
subject_status_9
|
Field containing subject specific status information |
subject status |
CDH3, CDH4 |
tasks_completed
|
Tasks Completed |
tasks |
CDH3, CDH4 |
tasks_failed_ping
|
Tasks Failed: Ping |
tasks |
CDH3, CDH4 |
tasks_failed_timeout
|
Tasks Failed: Timeout |
tasks |
CDH3, CDH4 |
total_cpu_system
|
Total System CPU |
seconds |
CDH3, CDH4 |
total_cpu_user
|
Total CPU user time |
seconds |
CDH3, CDH4 |
tt_blacklisted
|
TaskTracker Blacklisted Status |
blacklisted status |
CDH3, CDH4 |
unexpected_exits
|
Unexpected rocess exits |
unexpected exits |
CDH3, CDH4 |
update_private_distributed_cache_sizes_avg_time
|
Update Private Distributed Cache Sizes Average Time |
ms |
CDH3 |
update_private_distributed_cache_sizes_num_ops
|
Update Private Distributed Cache Sizes Operations |
operations |
CDH3 |
web_metrics_collection_duration
|
Web Server Responsiveness |
ms |
CDH3, CDH4 |
web_metrics_collection_status
|
Web Metric Collection Status |
metric collection status |
CDH3, CDH4 |