Transforming 3D Coloured Pixels into Musical Instrument Notes for Vision Substitution Applications
- PDF / 4,438,176 Bytes
- 15 Pages / 600.03 x 792 pts Page_size
- 80 Downloads / 148 Views
Research Article Transforming 3D Coloured Pixels into Musical Instrument Notes for Vision Substitution Applications Guido Bologna,1 Benoˆıt Deville,2 Thierry Pun,2 and Michel Vinckenbosch1 1 University 2 Computer
of Applied Science, Rue de la prairie 4, 1202 Geneva, Switzerland Science Center, University of Geneva, Rue G´en´eral Dufour 24, 1211 Geneva, Switzerland
Received 15 January 2007; Accepted 23 May 2007 Recommended by Dimitrios Tzovaras The goal of the See ColOr project is to achieve a noninvasive mobility aid for blind users that will use the auditory pathway to represent in real-time frontal image scenes. We present and discuss here two image processing methods that were experimented in this work: image simplification by means of segmentation, and guiding the focus of attention through the computation of visual saliency. A mean shift segmentation technique gave the best results, but for real-time constraints we simply implemented an image quantification method based on the HSL colour system. More particularly, we have developed two prototypes which transform HSL coloured pixels into spatialised classical instrument sounds lasting for 300 ms. Hue is sonified by the timbre of a musical instrument, saturation is one of four possible notes, and luminosity is represented by bass when luminosity is rather dark and singing voice when it is relatively bright. The first prototype is devoted to static images on the computer screen, while the second has been built up on a stereoscopic camera which estimates depth by triangulation. In the audio encoding, distance to objects was quantified into four duration levels. Six participants with their eyes covered by a dark tissue were trained to associate colours with musical instruments and then asked to determine on several pictures, objects with specific shapes and colours. In order to simplify the protocol of experiments, we used a tactile tablet, which took the place of the camera. Overall, colour was helpful for the interpretation of image scenes. Moreover, preliminary results with the second prototype consisting in the recognition of coloured balloons were very encouraging. Image processing techniques such as saliency could accelerate in the future the interpretation of sonified image scenes. Copyright © 2007 Guido Bologna et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1.
INTRODUCTION
Echolocation is a mode of perception used spontaneously by many blind people. It consists in perceiving the environment by generating sounds and then listening to the corresponding echoes. Reverberations of various types of sound, such as slapping of the fingers, murmured words, whistles, noise of the steps, or sounds from a cane are commonly used. In this work we present See ColOr (Seeing Colours with an Orchestra), which is a multidisciplinary project at the cross-road of computer vision, audio processing and
Data Loading...