Monitoring in Mesos
In this section, we will take a look at the different metrics that Mesos provides to monitor the various components.
Monitoring provided by Mesos
Mesos master and slave nodes provide rich data that enables resource utilization monitoring and anomaly detection. The information includes details about available resources, used resources, registered frameworks, active slaves, and task state. This can be used to create automated alerts and develop a cluster health monitoring dashboard. More details can be found here:
http://mesos.apache.org/documentation/latest/monitoring/.
Network statistics for each active container are published through the /monitor/statistics.json
endpoint on the slave.
Mesos provides two different kinds of metrics: counters and gauges. These can be explained as follows:
- Counters: This is used to measure discrete events, such as the number of finished tasks or invalid status updates. The values are always whole numbers.
- Gauges: This is used to check the snapshot of a particular metric, such as the number of active frameworks or running tasks at a particular time.