what is performance?
the ability of a system to meet timing requirements
for the performance general scenario, what is the source
internal or external to the system
for the performance general scenario, what is the stimulus
arrival of a periodic, sporadic, or stochastic event
for the performance general scenario, what is the artifact
the system or one or more of its components
for the performance general scenario, what are the 4 possible environments
normal mode
emergency mode
peak load
overload mode
for the performance general scenario, what are the 2 responses
process events
change level of service
for the performance general scenario, what are the 5 possible response measures
latency deadline throughput jitter miss rate
what is the primary goal of performance tactics
to generate a response to an event arriving at the system within some time-based constraint
2 categories of performance tactics
control resource demand
manage resources
6 tactics for controlling resource demand
what does it mean to manage sampling rate
to reduce the sampling frequency at which a stream of data is captured if possible typically with some loss of fidelity
what does it mean to limit event response
process events only up to a set maximum rate to ensure more predictable processing
what does it mean to prioritize events
you can rank events in order of how important it is to service them
what does it mean to bound execution times
place a limit on how much execution time is used to respond to an event
what does it mean to increase resource efficiency
improving algorithms used in critical areas to decrease latency
6 tactics for managing resources
increase resources
introduce concurrency
maintain multiple copies of computations
maintain multiple copies of the data
bound queue sizes
schedule resources
what does it mean to maintain multiple copies of computations
have replicas on different servers to reduce resource contention that would occur if all computations took place on a single server
what does it mean to maintain multiple copies of data
to keep copies of data on storage media with different access speeds
what does it mean to bound queue sizes
control the maximum number of queued arrivals and consequently, the resources used to process the arrivals
what does it mean to schedule resources
when there is resource contention, the resource must be scheduled
2 issues with performance in regards to allocation of responsibilities
5 things to consider for system responsibilities that involve heavy loading or have time-critical response requirements
4 things to consider when evaluating a coordination model with respect to performance
4 things to consider for data abstractions of the data model that will be heavily loaded or have time-critical response requirements