Week 10 Flashcards

Question

compilers and compiling for ILP (instruction level parallelism)

Answer 1

- Compiler can find ILP at a higher level of abstraction than hardware - Compiler has access to source code, can analyze it, use techniques to reduce stalls

Answer 2

- technique to get more performance from loops that access arrays, in which multiple copies of the loop body are made and instructions from different iterations are scheduled together - can reduce dependence caused stalls if iterations are independent of another

Answer 3

time between the input and the output of an instruction

Answer 4

- Reorder machine instructions from obvious compiled order to reduce dependence caused stalls - technique to reduce dependence caused stalls

Answer 5

- a practical reference tool used by programmers and system designers to understand and compare the typical delays associated with various operations - can help you figure out how to reorder instructions to reduce stalls

Answer 6

- an unscheduled event that disrupts program execution - used to detect overflow

Answer 7

- An exception that comes from outside of the processor - Some architectures use the term interrupt for all exceptions

Answer 8

An interrupt for which the address to which control is transferred is determined by the cause of the exception

Answer 9

To discard instructions in a pipeline, usually due to an unexpected event

Answer 10

- move up the branch address calculations from EX to ID stage (calculate all possibilities at once) - moving up the branch decision (much more difficult)

Answer 11

Interrupts/exceptions in pipelined computers that are not associated with the exact instruction that was the cause of the interrupt or exception

Answer 12

An interrupt or exception that is always associated with the correct instruction in pipelined computers

Answer 13

- save address of problem instruction in exception link register (ELR) - transfer control to OS at some specified address - OS handles exception, either stops program or handles it and continues

Answer 14

- A 64-bit register used to hold the address of instruction when exception happens - needed for vectored interrupt

Answer 15

- A register used to record the cause of the exception - n LEGv8, this register is 32 bits, although some bits are currently unused

Answer 16

The parallelism among instructions

Answer 17

- a sequence of instructions with a single entry point (the first instruction) and a single exit point (the last instruction) - blocks execute sequentially within themselves

Answer 18

extra resources used to support non essential tasks like function calls, etc. that manage execution of task but don't influencer result of intended computation

Answer 19

the final version of code that has been optimized for performance

Answer 20

- an approach where the compiler or processor guesses the outcome of the instruction to remove it as a dependence in executing other instructions - ex. assuming branch taken so instructions that come after can be taken earlier

Answer 21

- dedicated pipelines where instructions wait to be processed, allowing processors to fetch and dispatch multiple instructions per cycle - may be determined statically by the compiler or dynamically by the processor - task of multiple issue is determining which issue slots should be used for which instructions

Answer 22

- style of instruction set architecture that launches many operations that are defined to be independent in a single wide instruction - typically has many separate opcode fields - can think of issue packet at VLIW

Answer 23

- A scheme whereby multiple instructions are launched in one clock cycle - this allows the CPI to be less than 1

Answer 24

An approach to implementing a multiple-issue processor where many decisions are made by the compiler before execution

Answer 25

An approach to implementing a multiple-issue processor where many decisions are made during execution by the processor

Answer 26

when one instruction is launched per clock cycle

Answer 27

- the set of instructions that issues together in one clock cycle - the packet may be determined statically by the compiler or dynamically by the processor

Answer 28

- can take time to backtrack if guess wrong - can make exceptions happen that shouldn't

Answer 29

- basically more hardware - add more ports in register file for reading/writing - add another adder to calculate addresses

Answer 30

An ordering forced by the reuse of a name, typically a register, rather than by a true dependence that carries a value between two instructions

Answer 31

- the renaming of registers by the compiler or hardware to remove antidependence (name dependence) - used with loop unrolling

Answer 32

number of clock cycles between a load instruction and an instruction that can use the result of the load without stalling the pipeline

Answer 33

- taken branches - place BTB in IF stage and then fetch a target instruction as the actual next instruction

Week 10 Flashcards

4.8-4.10, 4.14-4.15 (57 cards)