18-GPU-Atomic-Operations Flashcards

Question 1

Q

What is an atomic operation?

Answer

A

Operation that completes without interruption - appears instantaneous

Question 2

Q

What is the load-compute-store problem?

Answer

A

Multiple threads reading, modifying, writing same variable cause races

Question 3

Q

What is the histogram example?

Answer

A

Counting frequency of values - multiple threads incrementing same counters

Question 4

Q

What is the problem with naive histogram?

Answer

A

Race conditions cause lost increments

Question 5

Q

What is the solution?

Answer

A

Use atomic_inc() for thread-safe increments

Question 6

Q

What are OpenCL atomic operations?

Answer

A

atomic_inc, atomic_dec, atomic_add, atomic_min, atomic_max, etc.

Question 7

Q

What is atomic_inc(p)?

Answer

A

Atomically increments *p, returns old value

Question 8

Q

What is atomic_cmpxchg(p

Question 9

Q

What is the spinlock pattern?

Answer

A

while(atomic_cmpxchg(&lock, 0, 1) == 1) - busy wait for lock

Question 10

Q

What is the problem with spinlocks on GPU?

Answer

A

Global memory lock causes contention, divergence issues

Question 11

Q

What is lock-free programming?

Answer

A

Thread-safe data structures without locks, using atomics

Question 12

Q

What is the linked list prepend problem?

Answer

A

Multiple threads adding to list head simultaneously

Question 13

Q

What is the lock-free solution?

Answer

A

Use atomic_cmpxchg in loop to update head pointer atomically

Question 14

Q

What is optimistic concurrency?

Answer

A

Assume operation succeeds, retry if conflict detected

Question 15

Q

What is the trade-off with atomics?

Answer

A

Lower overhead than locks but limited to simple operations

Question 16

Q

When to use atomics vs locks?

Answer

Study These Flashcards

A

Atomics for simple operations, locks for complex multi-statement sections

Question 17

Q

What is the histogram optimization?

Answer

Study These Flashcards

A

Use local memory per work group, then atomic add to global

Question 18

Q

Why use local memory for histogram?

Answer

Study These Flashcards

A

Reduces contention on global memory locations

Question 19

Q

What is the local histogram pattern?

Answer

Study These Flashcards

A

Each work group builds local histogram, then atomically adds to global

Question 20

Q

What is the performance benefit?

Answer

Study These Flashcards

A

Less contention, local memory is faster

Question 21

Q

What is the key insight about GPU atomics?

Answer

Study These Flashcards

A

Essential for coordinating access to shared resources like histograms

18-GPU-Atomic-Operations Flashcards

(21 cards)