Exact Inference - 07 Flashcards by Joana Saraiva

What is marginal inference?

We ask the question ‘What is the probability of a given variable in our model (possibly conditioned on evidence)?’.

How well did you know this?

Not at all

Perfectly

What is maximum a posteriori (MAP) inference?

We ask the question ‘What is the most likely assignment to the variables in the model (possibly conditioned on evidence)?’

How well did you know this?

Not at all

Perfectly

Inference is not challenging. True or False?

False

How well did you know this?

Not at all

Perfectly

Chains and trees are ______________. Loopy graphs are not.

tractable

How well did you know this?

Not at all

Perfectly

If a problem is intractable, we are still able to obtain useful answers via _______________ inference methods.

approximate

How well did you know this?

Not at all

Perfectly

Which are the methods for inference in graphical models?

Variable elimination
Belief propagation (aka message passing algorithms)
Approximate inference methods (aka Monte Carlo techniques)
Variational inference

How well did you know this?

Not at all

Perfectly

What is dynamic programming?

Dynamic programming ”inverts” the order of computation, performing it inside out instead of outside in.

How well did you know this?

Not at all

Perfectly

What are the 2 ideas that help us address the exponential blowup of the joint distribution in the variable elimination method?

Because of the structure of the Bayesian network, some subexpressions in the joint depend only on a small number of variables.
By computing these expressions once and caching the results, we can avoid generating them exponentially many times.

How well did you know this?

Not at all

Perfectly

Why does the order of elimination matter?

!= elimination orders –> != intermediate factor sizes –> != complexity

How well did you know this?

Not at all

Perfectly

What is the naïve approach in the variable elimination method?

First, evaluate the joint distribution and then perform the summation explicitly.

How well did you know this?

Not at all

Perfectly

Define tree in an undirected graph.

In an undirected graph, a tree is defined as a graph in which there is one path between any pair of nodes.

How well did you know this?

Not at all

Perfectly

Trees have loops. True or False?

False

How well did you know this?

Not at all

Perfectly

Define tree in a directed graph.

In directed graphs, a tree is defined such that there is a single node, called the root, which has no parents, and all other nodes have one parent.

How well did you know this?

Not at all

Perfectly

If there are nodes in a directed graph that have more than one parent, but there is still one path between any two nodes, then the graph is called a _____________.

Such a graph will have more than one node with the property of having ____ ___________, and the corresponding moralized undirected graph will have __________.

polytree; no parents; loops

How well did you know this?

Not at all

Perfectly

______________ algorithm is used for marginal inference.

_______________ algorithm is used for MAP inference.

Sum-product; Max-product

How well did you know this?

Not at all

Perfectly

Marginal is proportional to the product of the incoming messages. True or False?

Study These Flashcards

True

In the sum-product algorithm, if the factor graph is derived from a directed graph, then the joint distribution is already correctly ________________, and so the marginals obtained will be normalized correctly.

If the factor graph is derived from an undirected graph, then we have the _______________ _________ Z and in this case we just need to marginalize over a single ____________ rather than the entire set of variables.

Study These Flashcards

normalized; normalization factor; variable

We can use the messages to compute the marginals of all variables in the graph. True or False?

Study These Flashcards

True

If each edge has a message in both directions, we can compute the marginals of all variables in the graph. True or False?

Study These Flashcards

True

The correspondence between messages and effective __________ allows us to find the joint distribution for variables connected to the same factor node (_____________).

Study These Flashcards

factors; neighbors

From a leaf variable node to a factor node, the message is _____.

Study These Flashcards

From a leaf factor node to a variable node, the message is the _____.

Study These Flashcards

factor

In the sum-product algorithm, the factor graph must be a _______.

Study These Flashcards

tree

The sum-product algorithm can be used to compute _______________.

This can be viewed as defining a new __________ ________ on the non-evidential variables.

Or, one can keep the original factor graph, but for factor-to-variable messages, the sum is only taken over _______________ variables; any evidential variables in the potential are set to their evidential states.

Study These Flashcards

conditionals; factor graph; non-evidential

The sum-product algorithm may be used for ______________ random variables. All stays the same, but we replace sums with ___________. In special cases, integral can be computed in closed form (e.g., Gaussian family). If not, need for ________________. ________________ are also needed for discrete random variables when K is _________.

continuous; integrals; approximations; approximations; large

What do we need to do if our factor graph is not a tree?

If factor graph is NOT a tree: - Group variables together so that the factor becomes a tree (called junction tree or clique tree) - Pretend the factor graph is a tree and use message passing (loopy belief propagation)

What is the max-sum algorithm?

The max-sum algorithm is the message passing algorithm by replacing sum with max, products with sums, and factors with log-factors.

In the max-sum algorithm, from a leaf variable node to a factor node, the message is ____.

In the max-sum algorithm, from a leaf factor node to a variable node, the message is the ________________ of the _________.

logarithm; factor

The message passing framework can be generalized to arbitrary graph topologies, giving an exact inference procedure known as the _____________ ________ algorithm. If the starting point is a directed graph, it is first converted to an undirected graph by ______________, whereas if starting from an undirected graph, this step is not required. Next, the graph is ________________, which involves finding chordless cycles containing four or more nodes and adding extra links to eliminate such chordless cycles. Next, the triangulated graph is used to construct a new tree-structured undirected graph called a join tree (or clique tree), whose nodes correspond to the maximal _________ of the triangulated graph, and whose links connect pairs of cliques that have variables in common.

junction tree; moralization; triangulated; cliques

Understanding approximate inference gives a deeper context for why and how exact inference was needed. True or False?

False. It's the opposite.

Distinguish variable elimination and belief propragation.

Variable Elimination: Bayesian/Markov network; Expensive for many queries; Exact for all DAGs; Difficult parallelism Belief Propagation: Factor graph; Reuses computation via message; Exact only for trees, approximate for loopy graphs; Naturally parallelizable

A variable is conditionally independent of everything else given its Markov blanket. For a factor graph, the Markov blanket of a variable x is all ________ nodes adjacent to x , and all ___________ nodes adjacent to those _________.

factor; variable; factors

Exact Inference - 07 Flashcards

(33 cards)