Do Demonstration 2.1. Define “perception,” and provide an original example of perception. Define “object recognition,” “distal stimulus,” and “proximal stimulus.” What is sensory memory and the primary visual cortex?
Perception is the process of organizing and interpreting sensory information to give it meaning; for example, recognizing a friend’s face in a crowd. Object recognition is identifying objects by matching sensory input with stored knowledge. A distal stimulus is the actual object in the environment, while a proximal stimulus is the image of that object on sensory receptors (e.g., retina).
Sensory memory is a very brief storage system that preserves sensory impressions for milliseconds to a few seconds. The primary visual cortex is the first cortical area in the occipital lobe that processes visual input, analyzing basic features such as edges, orientation, and motion.
Describe the ambiguous figure-ground illusion and the illusory contour effect. What do these illusions indicate about visual perception?
The ambiguous figure-ground illusion occurs when a visual stimulus can be perceived as either the foreground or background, such as a vase that also appears as two faces. The illusory contour effect occurs when people perceive edges and shapes that are not physically present, such as seeing a triangle formed by gaps in circles.
These illusions show that perception is constructive and guided by the brain’s interpretations rather than being a direct copy of sensory input. The visual system actively organizes stimuli using prior knowledge and expectations.
Describe the following theories of object recognition, and explain the advantages and disadvantages of each:
template matching theory
feature-analysis models
recognition-by-components model
Comment: Recognition-by-components and feature-analysis are closely related models with similar operations. The major difference lies in the unit of recognition (e.g., feature versus geons). These two models can also be described using the parallel distributed processing approach. Features (or geons) could exist as highly interconnected networks. The objects being recognized could be represented as the connection weights between the activated features (or geons).
Template matching theory proposes that objects are recognized by comparing stimuli to stored templates; it is simple but inefficient because it requires many templates and fails with viewpoint changes. Feature-analysis models propose recognition through identification of basic features (lines, curves); they are flexible and efficient but struggle to explain complex object recognition alone.
Recognition-by-components model proposes that objects are recognized by combining simple 3D shapes called geons; it explains viewpoint invariance and complex recognition but may oversimplify curved or irregular objects. Comment: Recognition-by-components and feature-analysis are closely related models with similar operations. The major difference lies in the unit of recognition (e.g., feature versus geons). These two models can also be described using the parallel distributed processing approach. Features (or geons) could exist as highly interconnected networks. The objects being recognized could be represented as the connection weights between the activated features (or geons).
Do Demonstration 2.2. Consider the following letter pairs: E and F, K and M, Z and B, and N and M. Which pair of letters shares the most features, and which pair shares the least? Which pair would require the most time for someone to determine whether they are different? Which pair would require the least amount of time to arrive at this determination?
E and F share the most features because they differ by only one line segment, while Z and B share the least features due to very different shapes. Pairs with more shared features require more careful comparison.
Therefore, E and F would take the most time to judge as different, while Z and B would take the least time because their differences are more visually distinct.
Define and describe “bottom-up processing” and “top-down processing.” Give original examples of each type of perceptual processing.
Comment: Top-down processing and bottom-up processing are one of Matlin and Farmer’s five themes in the text. In the context of perception, these processes are used to describe how knowledge (input for top-down processes) and the external environment (input for bottom-up processes) are combined to recognize objects as old or new. The concepts of top-down and bottom-up processing are used in other cognitive domains, such as language comprehension. Note that, regardless of the domain, there are few instances in which top-down or bottom-up processing operate in isolation. However, in familiar contexts, top-down processing is more likely to dominate processing as we rely more on our knowledge and expectations, whereas in novel situations we are more likely to use bottom-up strategies. When you generate your examples of these processes, be sure that you can explain why top-down or bottom-up processing is the dominant process.
Bottom-up processing is perception driven by sensory input, building from basic features to complex understanding; for example, identifying a new symbol by analyzing its lines and shapes. Top-down processing is perception guided by prior knowledge, expectations, and context; for example, reading messy handwriting by using sentence context to guess unclear words.
Comment: Top-down processing and bottom-up processing are one of Matlin and Farmer’s five themes in the text. In the context of perception, these processes are used to describe how knowledge (input for top-down processes) and the external environment (input for bottom-up processes) are combined to recognize objects as old or new. The concepts of top-down and bottom-up processing are used in other cognitive domains, such as language comprehension. Note that, regardless of the domain, there are few instances in which top-down or bottom-up processing operate in isolation. However, in familiar contexts, top-down processing is more likely to dominate processing as we rely more on our knowledge and expectations, whereas in novel situations we are more likely to use bottom-up strategies. When you generate your examples of these processes, be sure that you can explain why top-down or bottom-up processing is the dominant process.
Do Demonstration 2.3. Briefly explain how top-down processing affects object recognition using this demonstration. What is “word superiority effect”? How is word superiority effect related to top-down processing?
Top-down processing influences object recognition by allowing context and prior knowledge to shape how ambiguous stimuli are interpreted. In demonstrations, letters are recognized more easily when embedded in meaningful words because expectations guide perception.
The word superiority effect is the finding that letters are identified more accurately and quickly when they appear in real words than in isolation or nonwords. This occurs because top-down processing uses word-level knowledge to facilitate recognition of individual letters.
Describe the phenomenon known as “change blindness.” Do Demonstration 2.4. Define “inattentional blindness,” and indicate how it differs from change blindness. What do these effects indicate about the role of top-down processes in visual object recognition?
Change blindness is the failure to notice significant visual changes when attention is diverted or when changes occur gradually. Inattentional blindness is the failure to notice an unexpected object because attention is focused elsewhere, even when the object is clearly visible.
Change blindness involves missing alterations over time, whereas inattentional blindness involves missing unexpected stimuli entirely. These effects show that attention and expectations strongly influence perception, demonstrating the importance of top-down processes in determining what we consciously perceive.
How is face perception different from normal object recognition? Define “prosopagnosia.” What does the neuroscience research on face recognition indicate? Summarize the applied research on face recognition. What do these findings reveal about our object recognition skills?
Face perception relies on specialized, holistic processing that emphasizes spatial relationships among features rather than independent parts. Prosopagnosia is a neurological disorder causing severe difficulty recognizing faces despite normal vision and intelligence.
Neuroscience research shows specialized brain regions such as the fusiform face area are highly active during face recognition. Applied research shows people recognize familiar faces well but are poor at identifying unfamiliar faces or matching photos, revealing that object recognition is highly experience-dependent and less reliable than we assume.
Define “phonemes.” Describe the four characteristics of speech perception, including definitions for inter-speaker variability, coarticulation, phonemic restoration, and the McGurk effect. Do Demonstrations 2.5 and 2.6. What do these characteristics reveal about the complexity of speech perception?
Phonemes are the smallest units of sound that distinguish meaning in a language. Inter-speaker variability refers to pronunciation differences across speakers. Coarticulation occurs when phonemes overlap in speech production. Phonemic restoration is perceiving missing sounds as present when masked by noise. The McGurk effect occurs when conflicting visual mouth movements alter what sound is heard.
These characteristics show speech perception is adaptive and constructive, integrating auditory input with visual cues, context, and prior knowledge. Speech understanding depends on complex neural processing rather than simple sound detection.
Describe the two theories of speech perception. Be sure that your answer includes a discussion of the phonetic module and categorical perception.
The motor theory proposes that speech perception depends on a specialized phonetic module that decodes sounds by referencing the articulatory gestures used to produce them. This module enables categorical perception, where continuous sounds are perceived as discrete phoneme categories.
The general auditory theory argues that speech perception relies on general-purpose auditory mechanisms rather than a specialized module. It also explains categorical perception as a product of learned auditory distinctions shaped by language experience.