Recognition by Components Theory
Encyclopedia
The Recognition-by-components theory, or RBC theory, is a bottom-up
process proposed by Irving Biederman
to explain object recognition. According to RBC theory, we are able to recognize objects by separating them into geons
(the object’s main component parts). Biederman suggested that geons are based on basic 3-dimensional shapes (cylinders, cones, etc.) that can be assembled in various arrangements to form a virtually unlimited amount of objects.
The theory proposes that when we view objects we look for two important components.
The invariant edge properties are as follows:-
Our knowledge of these properties mean that when viewing an object or geon, we can perceive it from almost any angle. For example when viewing a brick we will be able to see horizontal sets of parallel lines and vertical ones, when considering where these points meet (co-termination) we are able to perceive the object.
has argued that RBC theory is the "preferred" mode of human object recognition, with a secondary process handling objects that are not distinguishable by their geons. He further states that this distinction explains research suggesting that objects may or may not be recognized equally well with changes in viewpoint².
Bottom-up
Bottom-up may refer to:* In business development, a bottom-up approach means that the adviser takes the needs and wishes of the would-be entrepreneur as the starting point, rather than a market opportunity ....
process proposed by Irving Biederman
Irving Biederman
Irving Biederman is an American vision scientist specializing in the study of brain processes underlying humans' ability to quickly recognize and interpret what they see. He has a Ph.D...
to explain object recognition. According to RBC theory, we are able to recognize objects by separating them into geons
Geon (psychology)
Geons are the simple 2D or 3D forms such as cylinders, bricks, wedges, cones, circles and rectangles corresponding to the simple parts of an object in Biederman's theory of object recognition. The theory proposes that the visual input is matched against structural representations of objects in the...
(the object’s main component parts). Biederman suggested that geons are based on basic 3-dimensional shapes (cylinders, cones, etc.) that can be assembled in various arrangements to form a virtually unlimited amount of objects.
Geons
The Recognition-by-components theory suggests that there are less than 36 geons which are combined to create the objects we see in day to day life. For example, when looking at a mug we break it down into two components – “cylinder” and “handle”. This too works for more complex objects, which in turn are made up of a larger amount of geons. Perceived geons are then compared with objects in our stored memory to identify what it is we are looking at.The theory proposes that when we view objects we look for two important components.
- Edges – This enables us to maintain the same perception of the object regardless of viewing orientation.
- Concavities – The area where two edges meet. These enable us to observe the separation between two or more geons.
Viewpoint Invariance
One of the most defining factors of the Recognition-by-Components theory is that it enables us to recognize objects regardless of viewing angle, this is known as viewpoint invariance. It is proposed that the reason for this effect is the invariant edge properties of geons.The invariant edge properties are as follows:-
- Curvature (various points of a curve)
- Parallel lines (two or more points which follow the same direction)
- Co-termination (the point at which two points meet and therefore cease to continue)
- Symmetry and Asymmetry
- Co-linearity (points branching from a common line)
Our knowledge of these properties mean that when viewing an object or geon, we can perceive it from almost any angle. For example when viewing a brick we will be able to see horizontal sets of parallel lines and vertical ones, when considering where these points meet (co-termination) we are able to perceive the object.
Strengths of the theory
Utilizing geons as structural primitives results in two key advantages. Because geons are based on object properties that are stable across viewpoint ('viewpoint invariant'), and all geons are discriminable from one another, a single geon description is sufficient to describe an object from all possible viewpoints. The second advantage is that considerable economy of representation is achieved: a relatively small set of geons form a simple "alphabet" that can combine to form complex objects. For example, with only 24 geons, there are 306 billion possible combinations of 3 geons, allowing for all possible objects to be recognised.Experimental evidence
- Participants show a remarkable ability to recognise objects despite visual noise provided that the geons are visible
- Removal of feature-relation information (relations between geons) impairs object recognition
- No visual priming if different geons are used between trials
Weaknesses
RBC theory is not in itself capable of starting with a photograph of a real object and producing a geons-and-relations description of the object; the theory does not attempt to provide a mechanism to reduce the complexities of real scenes to simple geon shapes. RBC theory is also incomplete in that geons and the relations between them will fail to distinguish many real objects. For example, a pear and an apple are easily distinguished by humans but lack the corners and edges needed for RBC theory to recognize they are different. However, Irving BiedermanIrving Biederman
Irving Biederman is an American vision scientist specializing in the study of brain processes underlying humans' ability to quickly recognize and interpret what they see. He has a Ph.D...
has argued that RBC theory is the "preferred" mode of human object recognition, with a secondary process handling objects that are not distinguishable by their geons. He further states that this distinction explains research suggesting that objects may or may not be recognized equally well with changes in viewpoint².