Define monitoring.
The continuous observation of a system to detect changes and ensure performance.
What is traceability?
The ability to track the history, application, or location of an item or activity.
Fill in the blank: Logging is the process of _______ events in a system.
recording
True or false: Metrics are quantitative measures used to assess performance.
TRUE
What does APM stand for?
Application Performance Management; tools to monitor and manage application performance.
Define log management.
The process of collecting, storing, and analyzing log data for insights.
What is the purpose of alerting in monitoring?
To notify stakeholders of issues or anomalies in system performance.
Fill in the blank: Metrics can be categorized as _______ or business metrics.
technical
True or false: Distributed tracing helps in monitoring microservices.
TRUE
What is a log file?
A file that records events, transactions, or messages generated by a system.
Define observability.
The ability to infer the internal state of a system based on external outputs.
What is the role of metrics collection tools?
To gather and store performance data for analysis and reporting.
Fill in the blank: Syslog is a standard for _______ log messages.
sending
True or false: Centralized logging consolidates logs from multiple sources.
TRUE
What is Grafana used for?
A tool for visualizing metrics and logs through dashboards.
Define event correlation.
The process of linking related events to identify patterns or issues.
What is prometheus?
An open-source monitoring and alerting toolkit designed for reliability.
Fill in the blank: ELK Stack consists of Elasticsearch, Logstash, and _______.
Kibana
True or false: Metrics are always qualitative.
FALSE
What is the purpose of dashboards in monitoring?
To provide a visual representation of system performance and health.
Define service level agreement (SLA).
A contract that defines expected service performance and availability levels.
What does root cause analysis aim to identify?
The underlying reason for a problem or incident in a system.
Fill in the blank: Telemetry refers to the automated collection of _______.
data