Monitor

Monitor

work metrics

  • throughput:rps
  • success
  • error
  • performance: response time

resource metrics

  • utilization: percentage time that the resource is busy
  • saturation: measure of the amount of requested work that resource cannot yet service, often queued
  • errors: internal error that cannot observed in the resource produces
  • availability: represents the percentage of time that the resource responded to requests. This metric is only well-defined for resources that can be actively and regularly checked for availability.

event metrics

  • Changes: Internal code releases, builds, and build failures
  • Alerts: Internally generated alerts or third-party notifications
  • Scaling events: Adding or subtracting hosts