Common Distributions Flashcards by O Cam

What is a Bernoulli random variable?

A discrete random variable that takes value 1 with probability p and 0 with probability 1 − p, modeling a single success/failure trial.

How well did you know this?

Not at all

Perfectly

What is a typical use case for a Bernoulli distribution?

Modeling outcomes of yes/no events such as click vs no click, success vs failure, or positive vs negative label.

How well did you know this?

Not at all

Perfectly

What are the parameters of a Bernoulli distribution?

A single parameter p, the probability of success (X=1).

How well did you know this?

Not at all

Perfectly

What are the mean and variance of a Bernoulli(p) random variable?

Mean is p and variance is p(1 − p).

How well did you know this?

Not at all

Perfectly

What is a Binomial random variable?

A discrete random variable that counts the number of successes in n independent Bernoulli trials with success probability p.

How well did you know this?

Not at all

Perfectly

What are the parameters of a Binomial distribution?

Two parameters: n (number of trials) and p (success probability per trial).

How well did you know this?

Not at all

Perfectly

What are the mean and variance of a Binomial(n,p) random variable?

Mean is np and variance is np(1 − p).

How well did you know this?

Not at all

Perfectly

What is a typical ML-related use case of the Binomial distribution?

Modeling counts of positive events in a fixed number of trials, such as number of clicks in n impressions.

How well did you know this?

Not at all

Perfectly

What is a Geometric random variable?

A discrete random variable that represents the number of trials needed to get the first success in repeated Bernoulli trials.

How well did you know this?

Not at all

Perfectly

What is the key property of the Geometric distribution?

The memoryless property: the probability of needing k more trials does not depend on how many failures occurred previously.

How well did you know this?

Not at all

Perfectly

What is a common use of the Geometric distribution?

Modeling waiting times for the first occurrence of an event, such as first success or first failure.

How well did you know this?

Not at all

Perfectly

What is a Poisson random variable?

A discrete random variable that counts the number of events occurring in a fixed interval when events happen independently at a constant average rate.

How well did you know this?

Not at all

Perfectly

What is the parameter of a Poisson distribution?

A single parameter λ, the average rate or expected count over the interval.

How well did you know this?

Not at all

Perfectly

What are the mean and variance of a Poisson(λ) random variable?

Both mean and variance are equal to λ.

How well did you know this?

Not at all

Perfectly

What is a typical ML-related use case for Poisson distributions?

Modeling count data such as number of events, calls, or arrivals per time window.

How well did you know this?

Not at all

Perfectly

How is the Poisson distribution related to the Binomial distribution?

When the number of trials n is large and p is small with np fixed, a Binomial(n,p) can be approximated by a Poisson with λ = np.

How well did you know this?

Not at all

Perfectly

What is a discrete uniform distribution on a finite set?

A distribution where each of a finite set of k outcomes has equal probability 1/k.

How well did you know this?

Not at all

Perfectly

What is a continuous uniform distribution on an interval [a,b]?

Study These Flashcards

A distribution where the random variable is equally likely to take any value in [a,b], with constant density 1/(b − a).

What is the mean of a Uniform(a,b) continuous distribution?

Study These Flashcards

The mean is (a + b)/2.

What is the variance of a Uniform(a,b) distribution?

Study These Flashcards

The variance is (b − a)² / 12.

What is a Normal (Gaussian) random variable?

Study These Flashcards

A continuous random variable whose distribution has the bell-shaped curve defined by a mean μ and variance σ².

What are the parameters of a Normal distribution?

Study These Flashcards

Mean μ and variance σ² (or equivalently standard deviation σ).

What is the mean and variance of a Normal(μ,σ²) distribution?

Study These Flashcards

Mean is μ and variance is σ² by definition.

Why is the Normal distribution important in ML and statistics?

Study These Flashcards

Because many aggregated effects approximate a normal distribution (via CLT), and many models assume Gaussian noise or priors.

What is a standard Normal distribution?

A Normal distribution with mean 0 and variance 1, often denoted Z ~ N(0,1).

How can any Normal(μ,σ²) variable be standardized to a standard Normal?

By subtracting the mean and dividing by the standard deviation: Z = (X − μ)/σ.

What is the Exponential distribution?

A continuous distribution describing the time between events in a Poisson process, with density decreasing exponentially.

What is the parameter of an Exponential distribution?

Rate parameter λ > 0, where mean waiting time is 1/λ.

What is the mean and variance of an Exponential(λ) distribution?

Mean is 1/λ and variance is 1/λ².

What key property does the Exponential distribution share with the Geometric distribution?

The memoryless property: future waiting time does not depend on how long you have already waited.

What is a common modeling use of the Exponential distribution?

Modeling waiting times or lifetimes when events occur at a constant rate and independently over time.

What is a Gamma distribution (at a high level)?

A continuous distribution that generalizes the Exponential, modeling the sum of multiple independent Exponential waiting times.

What are typical parameters of a Gamma distribution?

Shape parameter k (or α) and rate parameter θ⁻¹ or λ; conventions vary.

What is a common use of the Gamma distribution in ML?

As a prior over positive quantities like rates or variances in Bayesian models.

What is a Beta distribution (at a high level)?

A continuous distribution defined on [0,1] with flexible shapes controlled by two parameters, often used to model probabilities.

What are the parameters of a Beta distribution?

Two positive shape parameters α and β.

Why is the Beta distribution often used as a prior for probabilities?

It is defined on [0,1], can represent many shapes (uniform, skewed, peaked), and is conjugate to the Bernoulli/Binomial likelihood.

What is a log-normal distribution?

A distribution where the logarithm of the variable is normally distributed.

When might log-normal distributions appear in ML contexts?

For modeling positive, multiplicative quantities such as incomes, traffic, or response times that have right-skewed distributions.

Why are heavy-tailed distributions important to recognize?

They indicate higher probability of extreme values than the Normal, affecting risk, outlier handling, and model robustness.

What does it mean to choose a 'likelihood' in a statistical model?

Selecting a distribution family that describes how observed data are generated from underlying parameters or latent variables.

Why should the choice of distribution be guided by the data?

Using an inappropriate distribution can lead to biased estimates, poor uncertainty quantification, and mis-specified models.

What is the difference between parametric and nonparametric distributions (roughly)?

Parametric distributions are described by a finite set of parameters (e.g., Normal with μ,σ²); nonparametric methods make fewer assumptions and allow more flexible shapes.

In one sentence, what is the role of common distributions in ML?

They provide building blocks for modeling noise, counts, waiting times, and probabilities, and underpin many loss functions and inference procedures.

Common Distributions Flashcards

(44 cards)