Flume Health Checks
Flume Agents Health
This is a Flume service-level health check that checks that enough of the Flume agents in the cluster are healthy. The check returns "Concerning" health if the number of healthy Flume agents falls below a warning threshold, expressed as a percentage of the total number of Flume agents. The check returns "Bad" health if the number of healthy and "Concerning" Flume agents falls below a critical threshold, expressed as a percentage of the total number of Flume agents. For example, if this check is configured with a warning threshold of 80% and a critical threshold of 60% for a cluster of five Flume agents, this check would return "Good" health if four or more Flume agents have good health. This check would return "Concerning" health if at least three Flume agents have either "Good" or "Concerning" health. If more than two Flume agents have bad health, this check would return "Bad" health. A failure of this health check indicates unhealthy Flume agents. Check the status of the individual Flume agents for more information. This test can be configured using the Healthy Flume Agent Monitoring Thresholds Flume service-wide monitoring setting.
Short Name: Flume Agents Health
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Healthy Flume Agent Monitoring Thresholds | The health check thresholds of the overall Flume Agents health. The check returns "Concerning" health if the percentage of "Healthy" Flume Agents falls below the warning threshold. The check is unhealthy if the total percentage of "Healthy" and "Concerning" Flume Agents falls below the critical threshold. | flume_agents_healthy_thresholds | critical:never, warning:95.000000 | PERCENT |
<< | ||
Terms and Conditions Privacy Policy |