What is the problem and goal with many-to-one mapping?
where does object recognition take place?
models of object recognition: The problem and three levels of analysis
The Problem:
- All models share a common assumption
- The senses register the presence of a stimulus. An internal representation of the stimulus is generated (perceptual representation). The object is recognised when there is a match between the perceptual representation and some stored representation of the same object. Object recognition requires the interaction of perception and memory
- Several models have attempted to explain how the visual system constructs a perception of a recognised object
- David Marr states object recognised includes a computational approach and has three levels of analysis
1. Computational – what is the system doing and why?
2. Algorithmic – which processes, rules, and algorithms are used to solve the problem?
3. Implementational – how are these processes implemented by the system
What are template-matching models?
what are feature detection models?
What are some controversies in object recognition?
Structural description models: Marr & Nishihara (1978)
Structural description models: Biedermann (1987)
Assessment of structural description models
Pros:
- Invariance is well explained
- Recognition relies on description rather than matching
- Graded representations cope with discrimination and generalization
- Evidence that structural information matters to humans and to neurons
Cons:
- Extracting model parameters can be hard in real images (e.g. occlusion)
- Structural description is difficult for some objects (e.g. crumpled paper)
- Driven by theoretical desirability rather than behavioural or physiological evidence
what are view-dependent models?
Evidence for view-dependent models
Assessment of view-dependent models
Pros:
- Straightforward
- Minimises transformations that must be performed
- Newer models are based directly on what we know of physiology
- Abstract features are recombinable
- Good behavioural, physiological, and stimulation-based evidence
Cons:
- Humans often show quite good generalisation across viewpoints even for novel objects
- Still more memory intensive than e.g. geon model