What is availability?
a property of software that it is there and ready to carry out its task when you need it to be
How is availability different from reliability?
it builds on reliability by adding the notion of recovery and repair
Availability general scenario: what are 5 possible values for “Source”
internal/external:
Availability general scenario: what are 4 possible values for “Stimulus”
Availability general scenario: what are 4 possible values for “Artifact”
Availability general scenario: what are 6 possible values for “Environment”
Availability general scenario: what are 3 possible values for “Response”
Availability general scenario: what are 6 possible values for “Response Measure”
2 system actions that are done in order to “detect the fault”
- notify appropriate entities
4 possible system actions that can be done in order to “recover from fault”
Definition of availability tactics?
they enable a system to endure faults so that services remain compliant with their specifications
The main goal of availability tactics?
to keep faults from becoming failures or at least bound the effects of the fault and make repair possible
9 tactics for detecting faults
Tactic for detecting faults: What is ping/echo?
an asynchronous request/response message pair exchanged between nodes, used to determine reachability and the round-trip delay through the associated network path
Tactic for detecting faults: What is a monitor?
a component used to monitor the state of health of other parts of the system
Tactic for detecting faults: What is a heartbeat?
a periodic message exchange between a system monitor and a process being monitored
Tactic for detecting faults: What is a timestamp?
used to detect incorrect sequences of events, primarily in distributed message-passing systems
Tactic for detecting faults: What is sanity checking?
checks the validity or reasonableness of a component’s operations or outputs
Tactic for detecting faults: What is condition monitoring?
checking conditions in a process or device, or validating assumptions made during the design
Tactic for detecting faults: What is voting?
to check that replicated components are producing the same results
Tactic for detecting faults: What is exception detection?
detection of a system condition that alters the normal flow of execution
Tactic for detecting faults: What is self-test?
a procedure for a component to test itself for correct operation
What are the 3 main categories of availability tactics?
What are the 2 sub-categories of availability tactics under recover from faults?
- reintroduction