Monday, February 9, 2009

Wavelet Applications

Computer and Human Vision


In the early 1980s, David Marr began work at MIT's Artificial Intelligence Laboratory on artificial vision for robots. He is an expert on the human visual system and his goal was to learn why the first attempts to construct a robot capable of understanding its surroundings were unsuccessful (2).

Marr believed that it was important to establish scientific foundations for vision, and that while doing so, one must limit the scope of investigation by excluding everything that depends on training, culture, and so on, and focus on the mechanical or involuntary aspects of vision. This low-level vision is the part that enables us to recreate the three-dimensional organization of the physical world around us from the excitations that stimulate the retina. Marr asked the questions:

  • How is it possible to define the contours of objects from the variations of their light intensity?
  • How is it possible to sense depth?
  • How is movement sensed?
He then developed working algorithmic solutions to answer each of these questions.

Marr's theory was that image processing in the human visual system has a complicated hierarchical structure that involves several layers of processing. At each processing level, the retinal system provides a visual representation that scales progressively in a geometrical manner. His arguments hinged on the detection of intensity changes. He theorized that intensity changes occur at different scales in an image, so that their optimal detection requires the use of operators of different sizes. He also theorized that sudden intensity changes produce a peak or trough in the first derivative of the image. These two hypotheses require that a vision filter have two characteristics: it should be a differential operator, and it should be capable of being tuned to act at any desired scale. Marr's operator was a wavelet that today is referred to as a "Marr wavelet."

No comments:

Post a Comment