TaskTracker Metrics

Metric Name Description Unit CDH Version
alerts Alerts events CDH3, CDH4
can_commit_avg_time Can Commit Average Time ms CDH3
can_commit_num_ops Can Commit Operations operations CDH3
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes CDH3, CDH4
cgroup_mem_rss Resident memory of the role's cgroup bytes CDH3, CDH4
cgroup_mem_swap Swap usage of the role's cgroup bytes CDH3, CDH4
cgroup_read_bytes Bytes read from all disks by the role's cgroup bytes CDH3, CDH4
cgroup_read_ios Number of read I/O operations from all disks by the role's cgroup ios CDH3, CDH4
cgroup_total_cpu_system CPU usage of the role's cgroup seconds CDH3, CDH4
cgroup_total_cpu_user User Space CPU usage of the role's cgroup seconds CDH3, CDH4
cgroup_write_bytes Bytes written to all disks by the role's cgroup bytes CDH3, CDH4
cgroup_write_ios Number of write I/O operations to all disks by the role's cgroup ios CDH3, CDH4
commit_pending_avg_time Commit Pending Average Time ms CDH3
commit_pending_num_ops Commit Pending Operations operations CDH3
done_avg_time Done Average Time ms CDH3
done_num_ops Done Operations operations CDH3
events_critical Critical Events events CDH3, CDH4
events_important Important Events events CDH3, CDH4
events_informational Informational Events events CDH3, CDH4
failed_dirs Failed Directories directories CDH3, CDH4
fatal_error_avg_time Fatal Error Average Time ms CDH3
fatal_error_num_ops Fatal Error Operations operations CDH3
fd_max Maximum number of file descriptors file descriptors CDH3, CDH4
fd_open Open file descriptors file descriptors CDH3, CDH4
fs_error_avg_time Filesystem Error Average Time ms CDH3
fs_error_num_ops Filesystem Error Operations operations CDH3
get_map_completion_events_avg_time Get Map Completion Events Average Time ms CDH3
get_map_completion_events_num_ops Get Map Completion Events Operations operations CDH3
get_protocol_version_avg_time Get Protocol Version Average Time ms CDH3
get_protocol_version_num_ops Get Protocol Version Operations operations CDH3
get_task_avg_time Get Task Average Time ms CDH3
get_task_num_ops Get Task Operations operations CDH3
host_health The health of the host on which the role runs kaiser health CDH3, CDH4
host_subject_status_1 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_10 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_2 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_3 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_4 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_5 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_6 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_7 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_8 Field containing subject specific status information subject status CDH3, CDH4
host_subject_status_9 Field containing subject specific status information subject status CDH3, CDH4
jvm_blocked_threads Blocked threads threads CDH3, CDH4
jvm_gc_count Number of garbage collections garbage collections CDH3, CDH4
jvm_gc_time_ms Total time spent garbage collecting (ms) ms CDH3, CDH4
jvm_heap_committed_mb Total amount of committed heap memory (MB) MB CDH3, CDH4
jvm_heap_used_mb Total amount of used heap memory (MB) MB CDH3, CDH4
jvm_max_memory_mb Maximum allowed memory (MB) MB CDH3, CDH4
jvm_new_threads New threads threads CDH3, CDH4
jvm_non_heap_committed_mb Total amount of committed non-heap memory (MB) MB CDH3, CDH4
jvm_non_heap_used_mb Total amount of used non-heap memory (MB) MB CDH3, CDH4
jvm_runnable_threads Runnable threads threads CDH3, CDH4
jvm_terminated_threads Terminated threads threads CDH3, CDH4
jvm_timed_waiting_threads Timed waiting threads threads CDH3, CDH4
jvm_total_threads Total threads threads CDH3, CDH4
jvm_waiting_threads Waiting threads threads CDH3, CDH4
kaiser_health Health value computed by Cloudera Manager kaiser health CDH3, CDH4
log_error Logged Errors messages CDH3, CDH4
log_fatal Logged Fatals messages CDH3, CDH4
log_info Logged Infos messages CDH3, CDH4
log_warn Logged Warnings messages CDH3, CDH4
login_failure_avg_time Average Failed Login Time ms CDH3
login_failure_num_ops Login Failures operations CDH3
login_success_avg_time Average Successful Login Time ms CDH3
login_success_num_ops Login Successes operations CDH3
map_task_slots Map Task Slots slots CDH3, CDH4
maps_running Running map tasks tasks CDH3, CDH4
ping_avg_time Ping Average Time ms CDH3
ping_num_ops Ping Operations operations CDH3
reduce_task_slots Reduce Task Slots slots CDH3, CDH4
reduces_running Reduces Running tasks CDH3, CDH4
report_diagnostic_info_avg_time Report Diagnostic Info Average Time ms CDH3
report_diagnostic_info_num_ops Report Diagnostic Info Operations operations CDH3
report_next_record_range_avg_time Report Next Record Range Average Time ms CDH3
report_next_record_range_num_ops Report Next Record Range Operations operations CDH3
role_start_time Role start time timestamp CDH3, CDH4
rpc_authentication_failures RPC Authentication Failures failures CDH3
rpc_authentication_successes RPC Authentication Successes successes CDH3
rpc_authorization_failures RPC Authorization Failures failures CDH3
rpc_authorization_successes RPC Authorization Successes successes CDH3
rpc_call_queue_length RPC Call Queue Length items CDH3
rpc_num_open_connections Open RPC Connections connections CDH3
rpc_processing_time_avg_time Average RPC Processing Time ms CDH3
rpc_processing_time_num_ops RPCs Processed operations CDH3
rpc_queue_time_avg_time Average RPC Queue Time ms CDH3
rpc_queue_time_num_ops RPCs Queued operations CDH3
rpc_received_bytes RPC Received Bytes bytes CDH3
rpc_sent_bytes RPC Sent Bytes bytes CDH3
scm_health Health value computed by Cloudera Manager SCM health CDH3, CDH4
scm_health_reason Reason for health value computed by Cloudera Manager SCM health reason CDH3, CDH4
scm_process_state Process State according to Cloudera Manager SCM process state CDH3, CDH4
scm_role_state Role state according to Cloudera Manager SCM role state CDH3, CDH4
shuffle_error_avg_time Shuffle Error Average Time ms CDH3
shuffle_error_num_ops Shuffle Error Operations operations CDH3
shuffle_exceptions_caught Shuffle Handler Exceptions Caught exceptions CDH3, CDH4
shuffle_failed_outputs Shuffle Handler Failed Requests requests CDH3, CDH4
shuffle_handler_busy_percent Shuffle Handler Busy Percentage percent CDH3, CDH4
shuffle_output_bytes Shuffle Output bytes CDH3, CDH4
shuffle_success_outputs Shuffle Handler Successful Requests requests CDH3, CDH4
slave_master_connectivity Indicates whether the master node detects the slave as connected master connectivity CDH3, CDH4
status_update_avg_time Status Update Average Time ms CDH3
status_update_num_ops Status Update Operations operations CDH3
subject_status_1 Field containing subject specific status information subject status CDH3, CDH4
subject_status_10 Field containing subject specific status information subject status CDH3, CDH4
subject_status_2 Field containing subject specific status information subject status CDH3, CDH4
subject_status_3 Field containing subject specific status information subject status CDH3, CDH4
subject_status_4 Field containing subject specific status information subject status CDH3, CDH4
subject_status_5 Field containing subject specific status information subject status CDH3, CDH4
subject_status_6 Field containing subject specific status information subject status CDH3, CDH4
subject_status_7 Field containing subject specific status information subject status CDH3, CDH4
subject_status_8 Field containing subject specific status information subject status CDH3, CDH4
subject_status_9 Field containing subject specific status information subject status CDH3, CDH4
tasks_completed Tasks Completed tasks CDH3, CDH4
tasks_failed_ping Tasks Failed: Ping tasks CDH3, CDH4
tasks_failed_timeout Tasks Failed: Timeout tasks CDH3, CDH4
total_cpu_system Total System CPU seconds CDH3, CDH4
total_cpu_user Total CPU user time seconds CDH3, CDH4
tt_blacklisted TaskTracker Blacklisted Status blacklisted status CDH3, CDH4
unexpected_exits Unexpected rocess exits unexpected exits CDH3, CDH4
update_private_distributed_cache_sizes_avg_time Update Private Distributed Cache Sizes Average Time ms CDH3
update_private_distributed_cache_sizes_num_ops Update Private Distributed Cache Sizes Operations operations CDH3
web_metrics_collection_duration Web Server Responsiveness ms CDH3, CDH4
web_metrics_collection_status Web Metric Collection Status metric collection status CDH3, CDH4