Object Recognition Flashcards

(37 cards)

1
Q

After visual information is processed by V1, where does it go? What type of visual information is processed here?

A

The extrastriate cortex, which contains multiple sub-areas (indicated by V and then a # larger than 1)
This is considered Mid-level Vision, or the “in-between” of more complex things

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

As we go from low-level vision (V1 and prior) to mid and high levels, what changes about the cells?

A

The types of info that cells are sensitive to becomes more complex
The size of space receptive fields cover generally increases as well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the two general pathways for visual information after V1 and the extrastriate cortex? What are they each concerned with?

A

Dorsal (“where”): only concerned with detecting presence
Ventral (“what”): only concerned with detecting identity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

After V1, where does the “what” pathway go? How does this area process information differently than V1?

A

V2 - still works with contrast between light and dark, but has a better understanding of what this means
- Categorizes boundary ownership (can tell when edges are part of stand-alone things or a continuous object)
- Can tell foreground vs background
- Understands transparency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

After V2, where does the “what” pathway go? What does this region of the brain do and what level of vision does it process?

A

Inferotemporal Cortex - the region of the brain that knows what objects are
- deals with high-level (complex) vision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What would a lesion in the inferotemporal cortex lead to?

A

Agnosia: a deficit in the ability to identify objects even when one is able to see them
- have intact acuity, but impairments in grouping principles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do receptive fields in the inferotemporal cortex work? (What do they respond to?)

A

Respond less to specific parts of space and more to particular types of stimuli, with different subregions for specific objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are a few of the different subregions in the inferotemporal cortex?

A

Fusiform Face Area: selective to faces
Parahippocampal Place Area: selective to scenes
Word Form Area: selective to the physical properties of words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are grandmother cells? Why is there skepticism about their existence?

A

Hypothetical cells in the inferotemporal cortex that code a VERY specific stimulus, such as the face of a specific person
- we cannot sample all possible faces
- cells recode over time (don’t hold the same information forever)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

There are two different theories for how information is processed in the visual system, what are they?

A

Feed-Forward Process: a unidirectional form of exchange in which higher levels do not send feedback to lower ones

Reverse Hierarchy Theory: allows for feed-forward processing (for initial information) as well as top-down control (in order to alter processing and attend to particular details)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a heuristic and why are they necessary for object recognition?

A

A shortcut that our brain uses to make processing faster
- our visual system makes assumptions about the state of the world (emergent processing) in order to make us more efficient
- they’re not always 100% accurate, but work most of the time
- aren’t a conscious decision that can be “turned off”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Illusory Contours

A

Perceive a contrast even though the actual visual information doesn’t change between the two parts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Gestalt Processing and what are the different assumptions that are part of it?

A

The whole is greater than the sum of its parts, as in, our perception cannot be defined by the actual pieces of the visual world

Examples: closure, proximity, similarity, continuation, parallelism, symmetry, connectedness, and common region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Closure

A

Assumption that objects are complete despite being obscured or occluded by other information
- nothing about the actual visual stimulus tells us that the figures close

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Proximity

A

Information that is relatively close together is grouped together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Similarity

A

Assumption that information of the same kind (shape, color, etc.) must group together

17
Q

Continuation

A

Assumption that lines, edges, and contours keep going in their general direction even when encountering other objects

18
Q

Parallelism and Symmetry

A

Parallel = moving in identical orientations
Symmetry = mirrored images of each other
Neither of these things typically occur unless they are part of the same object

19
Q

Connectedness and Common Region

A

Connectedness = it’s rare that objects be connected if they’re not part of the same system

Common Region = things enclosed in a shared space get grouped together

20
Q

What part of the “what” pathway sits between V2 and the inferotemporal cortex? What information does it process and understand?

A

V4: deals with more complex shapes, but not necessarily full objects
- may be particularly interested in curves and angles
- has “shape selectivity”, where cells respond more to certain patterns compared to others
- understands boundaries and occlusion (angles intrinsic to an object vs caused by occlusion)

21
Q

What does V4 assist with (theoretically)?

A

Might be the start of our visual system’s ability to detect object parts

22
Q

How does camouflage trick our visual assumptions?

A

Capitalizes on visual heuristics to disrupt Texture Segmentation - the visual system’s process of carving an image into regions based on texture properties

23
Q

What is Dazzle Camouflage?

A

Sort of like reverse camo, where you can definitely detect them, but it’s hard to identify edges, direction, and distance
Ex: zebras

24
Q

What is the purpose of camouflage?

A

Doesn’t have to be complicated or perfectly hide something, it’s meant to slow down object recognition to aid with survival

25
How does our perception compromise between competing factors? What are the "rules" that the "committee" uses to resolve information (principles of mid-level vision)?
- Groups and separates information in the visual world based on border ownership, texture, and the Gestalt principles - Uses information we already know - Assumes that viewpoints are not accidental - Seeks consensus to avoid ambiguity (picks an interpretation)
26
What are accidental viewpoints?
Viewing angles or positions that produce what appears to be a regularity in our vision, even though one is not actually present
27
How does our visual system use information we already know to make assumptions?
- Perceive things as abiding by the laws of physics - Perceive things the way they are most likely to appear - Assumes figure-ground relationships
28
What are the assumptions of figure-ground relationships?
- ground is larger in size - ground surrounds figure - things with high symmetry are generally figures
29
What is relatability and how does it relate to occlusion?
Assuming that two line segments that share the same slope or contour are part of a continuous whole (even if they are occluded by something else)
30
How does object recognition relate to memory?
Part of the reason we can recognize objects is because we remember previous things like them - need to be able to recognize things from different angles, contexts, and even when partially occluded
31
Template Model of Object Recognition (what are some strengths and weaknesses of it?)
A template is a visual pattern that is stored in memory and learned through experience - Assumes that you recognize an object by finding a matching template - Viewpoint matters Strength: fits well with the function of the inferotemporal cortex Weakness: we would have to have a ton of templates
32
Structural Model of Object Recognition (what are the strengths of it?)
Emphasize the importance of object components and their spatial relations - All shapes are made of basic components called geons - The geons present and the way they are arranged leads to the perception of different objects - Viewpoint invariant - Fits well with our understanding of V4
33
What are the different levels of object recognition (as it relates to vocabulary)? What can change the category being used?
Basic: something that shares a good number of common features ("bird") Specific: gaining a little bit of information about the object, but not necessarily enough to be more helpful in some cases ("robin") Global: losing a lot of information ("animal") Expertise can shift category use from basic to more specific
34
Why do basic/entrant level categories tend to be the most commonly used when labeling an object?
Basic vocabulary balances specificity (having enough common features to be a category) with generalizability (being broad enough for most people to understand what you mean)
35
What are some unique kinds of object recognition and the specific types of agnosia associated with them?
Faces and Letters/Words Prosopagnosia: facial recognition deficit Alexia: visual word recognition deficit
36
How are faces a unique kind of recognition?
- have a unique brain region (FFA) and special kind of agnosia - you can still discriminate between highly similar faces - holistic and configural processing (focusing on configuration; can't be explained by geons alone)
37
Facial Inversion Effect (what does this show us about facial processing)?
Process faces even if they are upside down and some of the components are wrong - Shows us that we process faces through the holistic processing of their configuration rather than just each piece in isolation