Perception Flashcards

Question

What is template matching?

Answer 1

It's a theory about how our brain stores a template and we recognize things by matching what we see to that template

Answer 2

It is an area in the temporal lobe that becomes active when we see faces

Answer 3

They are the smallest units of sound in a language. E.g. “bat” consist of 3 /b/, /a/, and /t/

Answer 4

It’s the property of a sound that makes it behave like a consonant

Answer 5

Cells in the visual cortex which respond positively to light in the center and negatively to light in the periphery or vice versa.

Answer 6

Cells in the visual cortex which respond positively to light on one side of a line and negatively to light on the other side or vice versa.

Answer 7

This sketch allows us to identify where objects are in space, relative to the viewer. It employs cues such as texture gradient, stereopsis and motion parallax.

Answer 8

The late stage representation of objects in the visual field. It identifies how parts go together to form an image of the objects we see. Gestalt principles take us to a 3-D model.

Answer 9

In the early stage of perception, features are extracted to make initial sense of the information and thereby make a primal sketch.

Answer 10

Impairment in recognition of objects presented visually. Two types: apperceptive agnosia and associative agnosia

Answer 11

Located behind the pupil. Actively focuses or bends light as it enters the eye so it falls on the fovea (also known as accommodation).

Answer 12

Photoreceptor cells in the retina of the eye that aid in night vision and motion detection. They provide poor colour vision and low resolution, resulting in lower visual acuity.

Answer 13

The opening in the middle of the iris. Changes size to allow different amount of light to enter the eye.

Answer 14

The processing of a stimulus in which information from the physical stimulus, rather than from general expectations/knowledge, is used to help recognise the stimulus. That is, information travels 'up' from the stimuli, via the senses, to the brain.

Answer 15

An illusion where a central circle appears larger or smaller depending on the size of the surrounding circles, despite the central circle’s actual size remaining constant.

Answer 16

In the early phase, shapes and objects are extracted from the visual scene. In the later phase, shapes and objects are recognised.

Answer 17

Elements tend to appear more closely packed together as the distance from the view increases.

Answer 18

The ability to perceive 3-D because two eyes receive slightly different view of the world.

Answer 19

Provides 3D information when an object is in motion. As more distant points move they will move more slowly across the retina than closer points.

Answer 20

- Illustrates the phoneme-restoration effect (good evidence exists for the role of context in the perception of speech) - Participants listened to “The state governors met with their respective legislatures convening in the capital city,” with a 120-ms tone replacing the middle s in legislatures - Only 1 in 20 participants reported hearing the pure tone, but couldn’t locate it correctly

Answer 21

Study that showed that people are much better at recognizing faces presented in upright orientation than other categories of objects, such as houses in same orientation. But when faces and object presented upside down, there is a dramatic decrease in its recognition, but not in the same way for other objects

Answer 22

- Study of the ability to detect changes in people’s faces - They found greater activation in the fusiform gyrus, when changes were detected, than when they were not

Answer 23

- They explain why people are more accurate when identifying the letter in the word context (see figure 2.26) - In figure 2.26.a, if only shown the very last letter or the first three letters, the participant wouldn’t be able to intuitively guess correct. Only when combining the two it becomes clear that the whole word must be WORK. It’s an unconscious inference. Same as in figure 2.26.b where multiple letters are unrecognizable, but when combining the fragments only one word is possible

Answer 24

- Participants were presented with either a letter (such as D) or a word (such as WORD). Then given a pair of alternatives and had to tell which alternative they had seen. E.g. If shown letter D, then D and K could be alternatives. If shown the word “WORD”, then WORD and WORK could be alternatives. Both scenarios differed only by the letter D or K - Results: 10% more accuracy in identifying the word than letter alone. Better discrimination between D and K better in the context of a word than as letters alone

Answer 25

- a patient with associative agnosia ( visual agnosia with regards to object recognition) was presented with a drawing of an anchor and was supposed to recreate it - patient was able to recreate the drawing relatively, but couldn't recognize it (he said it was an umbrella) - suggests that people with associative agnosia don't struggle with early processing, but with pattern recognition, that occurs later

Answer 26

- a soldier suffered brain damage, which resulted in a visual agnosia - he was able to recognize objects by their feel, smell, and sound, but couldn't distinguish a circle and a square from each other, recognize letters or faces, he could, however, distinguish colors from each other or tell in what direction an object is moving - suggests that there is a part of a brain responsible for transforming visual information into perceptual experience, another part of the brain is responsible for sensory information; visual perception is more than just "seeing"

Answer 27

- a patient with visual agnosia, due to damage in temporal lobe (and no damage in parietal lobe), could correctly reach out and grasp the door handle, even though she couldn't recognize the object - suggests that the "where" pathway (neural pathway carrying visual information from primary visual cortex to parietal lobe) is specialized in action

Answer 28

- study on a cat's primary visual cortex - they found differently configured receptive fields in primary visual cortex than in ganglion cells and cells in the lateral geniculate nucleus - they found cells that are elongated, in contrast to circular receptive fields of the on-off and off-on cells; there are edge detectors and bar detectors

Answer 29

- study on the primary visual cortex - it was found that besides cells that respond to particular patterns in particular locations, there are also cells that respond to particular patterns in many locations

Answer 30

- study on a macaque monkey - they found a neuron in the inferior temporal lobe of the monkey's brain that would only fire up when the monkey was shown a picture of a hand or something highly resembling a hand, while seeming somewhat insensitive to the hand location - this study suggests that there are cells that respond to particular complex patterns (like hands or faces) in many locations

Answer 31

Light energy - primal sketch - 2 1/2 - D sketch - 3 - D model - recognized objects

Answer 32

Gibson (1969) proposed how the features underlying the recognition of letters is divided. E.g. the capital letter A can be seen as consiting of a horizontal, two diagonals in opposite orientations, line intersection, symmetry and "vertical discontinuity". In comparison to the template model, some of the advantages are, simpler features making it easier to correct for the difficulties of the template model in recognizing full patterns Also highlights the most important parts of the relationships to the pattern. For example with A being three lines that intersect, two diagonals and one horizontal. The feature template also eliminates the need for a large amount of templates.

Answer 33

In relation to evidence for the existence of features as componets in pattern recognition. Kinney et al. looked into the "open H", which is when letters share features, people are more prone to confuse them. Participants were briefly exposed to a letter, and then tested to see what letter they saw. What they found out was, that their participants made 29 errors when presented with the letter G, 21 of the misclassifications were the letter C, 6 of them were the letter O, 1 misclassification was B, and 1 misclassification was 9. No other errors occured. Conclusion is, that participants were more often misclassfying the letter G with the letters sharing similar features with it.

Answer 34

Looked into feature-analysis Looked into psychological nystagmus which is the small drift that happens with a rate of 30 to 70 cycles per seconds in the eye, and how it ties in together with perception. What they specifically tested was disintegration of an image that is stabilized on the eye. What they found out, was that that the partial outlines show various patterns reported as the stabilized image began to disappear.

Answer 35

Made a deep convolutional network for object recognition In the 8-layerd model, image processing starts with the stimulus, then followed by five layers of pattern recognizers. Elements in small regions of pixels converge on elements in layer 1. Layer 1 elements converge on layer 2, then elements that recognize patterns of layer 1 elements that recognize patterns continue onto layer 3 and so on.

Answer 36

Made a deeper deep convolutional network than Krizhevsky et al. with 150 layers instead of the 8-layerd model for object recognition. Works by elements in small regions of pixels converge on elements in layer 1. Layer 1 elements converge on layer 2, then elements that recognize patterns of layer 1 elements that recognize patterns continue onto layer 3 and so on.

Answer 37

Involves watching the lips of someone. Hearing "ba" while seeing the lips move to say "ga" is often perceived as "da" by the listeners. Even when they know that the sound is ba, they often hear something else. So, they merge the auditory stimulus with the context provided by the lips.

Answer 38

Experimented with computer-generated syllables in which the delay between the release of air and the onset of voicing was varied from −150 ms to +150 ms. The participants had to identify which syllables began with 'b' and which with 'p'. At about 25 m/s there was a switch from 'b' to 'p' At 10 ms participants agreed that the sound was a 'b' and at 60 ms, they agreed that the sound was a 'p'. Because of this, perception of this feature was referred to as categorical.

Answer 39

Researched whether the fusiform gyrus is only specialized for face recognition. Gauthier et al. found that bird experts and car experts showed high activation in the fusiform gyrus when judging birds or cars, and another study showed that people who practiced recognizing unfamiliar objects called greebles also activated the fusiform gyrus. These studies showed that since we are very familiar with faces, we are good at making judgments about them, but similar detailed processing can happen with any type of object we have a lot of experience with.

Answer 40

Looked into which sounds were most often confused. Participants had to identify phonemes 'b', 'd', 'p', 't' by listening to the sounds 'ba', 'da', 'pa', 'ta' presented in noise. The result was that the participants often confused one sound in the noise for another, and they mostly confused consonants that differed by only one feature. E.g. when hearing 'p', the participants more often thought they heard 't', which only differs in place of articulation, rather than 'd', which differs in both place of articulation and voicing. Similarly, when hearing 'b', participants often thought they heard 'p', which differs only in voicing rather than 't', which differs in both features.

Answer 41

Goldstone (1994): Goldstone trained participants to categorize novel visual stimuli. where categories were determined by either either size or brightness of stimuli. People were better at discriminating to which category the stimuli belonged to when the aspect was relevant for discrimination. Goldstone & Hendrickson (2010) People are better at discriminating across categories (acquired distinctiveness) and decreased discriminabilty within categories (acquired equivalence).

Answer 42

There are clear boundaries for speech signals. Pisoni created nonlinguistic tones that had a distiguishing accoustic feature (comparable to voice-onset time in voicing, see p. 61 figure 2.24). It was either a low frequency tone, simultaneously presented with a high-fr. tone or it was lagged. Participant showed abrupt boundaries for speech signals. Thus for auditory categorical perception the signal doesn't need to be speech.

Answer 43

In his experiment to study how participants combine stimulus information from a letter with context information from the surrounding letters, Massaro showed 4 variations with different amounts of contextual evidence (see p.65, fig 2.27): 1: only "e" can make a word; 2: only "c" can make a word; 3: either "c" or "e" can make a word; 4: neither "e" or "c" can make a word. Massaro found when e looked less ambiguous and more like "e", prob. for identifying e increased, similiarly when context increased (see fig. 2.28). This model is called FLMP (fuzzy logical model of perception). Thus it can be concluded that contextual information combines independently with stimulus information to determine what pattern is perceived.

Answer 44

Kuhl trained chinchillas to discrimnate between syllabels "da" and "ta". Allthough these animals don't have a human vocal tract, they showed the same perceptual boundary humans do. Thus, categorical perception the signal doesn't depend on whether the perceiver has a human vocal or auditory system.

Perception Flashcards

(72 cards)