Finals Study Flashcards by Gavin Koonts

What is Attribute Inference (Model Inversion)?

Attack that infers sensitive features of an input from model outputs; works best with overfitting and high-fidelity outputs (logits). Example: Warfarin dosage → infer genotype.

How well did you know this?

Not at all

Perfectly

What conditions make model inversion effective?

Overfitting, smooth decision boundaries, high-resolution outputs (probabilities/logits), strong correlation between features and predictions.

How well did you know this?

Not at all

Perfectly

What is Property Inference?

Attack that learns global properties of the training set (e.g., most users wear glasses, dataset contains celebrities). Does NOT recover individuals.

How well did you know this?

Not at all

Perfectly

How does Property Inference work?

Train shadow models on datasets with/without property P; meta-classifier predicts whether target model’s training data had P.

How well did you know this?

Not at all

Perfectly

What is Model Extraction?

Attacker recreates a surrogate model f’(x) approximating target f(x). Often uses repeated queries, confidence scores, or boundary exploration.

How well did you know this?

Not at all

Perfectly

How is linear model extraction done?

Query n+1 points in n-dimensional space → solve system for weights w and bias b.

How well did you know this?

Not at all

Perfectly

What helps model extraction succeed?

Access to confidence scores/logits; smooth or simple model structure; deterministic output.

How well did you know this?

Not at all

Perfectly

What is Membership Inference?

Attack determining whether a specific record x was in the training set. Relies on overfitting and confidence differences.

How well did you know this?

Not at all

Perfectly

What is the Membership Inference pipeline?

Shadow models → attack model trained on their outputs → classify target model’s output as member/non-member.

How well did you know this?

Not at all

Perfectly

What is Federated Learning?

Server sends model → clients train locally → send updates → server aggregates. Raw data stays local.

How well did you know this?

Not at all

Perfectly

What are attack surfaces in FL?

Malicious server reading gradients, gradient leakage reconstructing inputs, malicious clients performing poisoning.

How well did you know this?

Not at all

Perfectly

What is DSSGD (Selective Gradient Descent)?

Clients send only top-K gradients. Reduces leakage of small gradients but selection pattern still leaks info.

How well did you know this?

Not at all

Perfectly

When does DSSGD fail?

When sensitive attributes heavily influence the largest gradients that are still transmitted.

How well did you know this?

Not at all

Perfectly

What is Secure Aggregation?

Mechanism where users add pairwise noise (“antiparticles” +x/-x); noise cancels during aggregation so server sees only sum.

How well did you know this?

Not at all

Perfectly

Pros of Secure Aggregation?

Strong privacy against malicious server; zero utility loss (noise cancels).

How well did you know this?

Not at all

Perfectly

Cons of Secure Aggregation?

Protocol complexity; requires handling dropouts; involves peer-to-peer coordination.

How well did you know this?

Not at all

Perfectly

What is Differential Privacy in FL?

Adds noise to protect individual updates. Only local DP (on-device) protects against malicious server; server-side DP does not.

How well did you know this?

Not at all

Perfectly

DP vs Secure Aggregation difference?

DP adds irreversible noise reducing utility; secure aggregation preserves utility but requires stronger protocol complexity.

How well did you know this?

Not at all

Perfectly

What is Fairness Through Blindness?

Study These Flashcards

Removing protected attributes (race/gender) from inputs. Fails because proxy variables still encode them.

What is Statistical Parity?

Study These Flashcards

Positive outcome probability should be equal across groups: P(positive|S) ≈ P(positive|S^c).

Limitation of Statistical Parity?

Study These Flashcards

Ignores correctness of predictions; can hide discriminatory error rates.

What is QII (Quantitative Input Influence)?

Study These Flashcards

Causal transparency method: replace a feature with random value from population; measure output change.

What questions does QII answer?

Study These Flashcards

“Did gender change the decision?” or “Which feature mattered most for this prediction?”

What is memorization in GenAI?

Study These Flashcards

LLMs store rare or duplicated sequences (k-eidetic memorization). Can reveal private data through prompting.

What increases GenAI memorization?

Duplicated content, unique/small datasets, long-tail tokens with little variance.

What are GenAI privacy attacks?

Training data extraction (prompt to reveal private strings), leakage through chat logs reused for training.

What mitigations reduce GenAI leakage?

DP training, gradient clipping, deduplication, limiting rare token retention.

What does HIPAA protect?

Medical/health records held by covered entities (hospitals, insurers, providers).

What does FERPA protect?

Student educational records; applies to schools and universities.

What does COPPA regulate?

Data collection from children under 13; requires parental consent; triggered by child-directed content.

What is GDPR known for?

EU opt-in model; broad definition of personal data (including inferences); strong user rights.

What is CCPA known for?

California opt-out law; “Do Not Sell My Info”; consumer rights to know/delete/share restrictions.

What are the four dimensions of privacy notices?

Timing, Channel, Modality, Control.

What makes privacy notices effective?

Contextual timing and causing actual behavior change (e.g., adjusting settings).

Why did P3P fail?

Low adoption, complexity, mismatch between policy text and machine-readable rules, no enforcement.

What do ML compliance systems do?

Use NLP and behavior logs to detect inconsistencies between policy claims and actual data practices.

What are hash pointers?

Pointer + cryptographic hash; ensures tamper-evident blockchain structure.

Why wait 6 confirmations in Bitcoin?

Heuristic for high confidence that a transaction is irreversible and not part of a fork.

What is the privacy weakness of Bitcoin?

It is pseudonymous, not anonymous; transaction graph analysis links addresses.

What breaks relationship anonymity in Bitcoin?

Combining multiple inputs in a single transaction reveals shared ownership.

What is CoinJoin?

Mixing technique where many users combine inputs into one transaction; hides sender→receiver mapping.

When does CoinJoin fail?

If users later consolidate mixed outputs, collapsing the anonymity set.

What is Website Fingerprinting?

Inferring visited sites via traffic metadata (timing, burst size, direction), even under Tor/VPN.

What defeats WF?

Padding (bandwidth cost) and batching/delays (latency cost).

What are acoustic side-channel attacks?

Use sound signatures (keyboard clicks, printer motors) to infer keystrokes or objects.

What are motion sensor inference attacks?

Use accelerometer/gyroscope/magnetometer data to infer driving routes, gestures, or user activities.

Finals Study Flashcards

(46 cards)