Robust and Adaptive OMR System Including Fuzzy Modeling, Fusion of Musical Rules, and Possible Error Detection
- PDF / 3,231,739 Bytes
- 25 Pages / 600.03 x 792 pts Page_size
- 52 Downloads / 179 Views
Research Article Robust and Adaptive OMR System Including Fuzzy Modeling, Fusion of Musical Rules, and Possible Error Detection Florence Rossant1 and Isabelle Bloch2 1 Telecom, 2 Signal
Signal, and Image Department, Institut Sup´erieur d’Electronique de Paris (ISEP), 21 Rue d’Assas, 75006 Paris, France and Image Processing Department, ENST, CNRS UMR 5141, 46 Rue Barrault, 75634 Paris Cedex 13, France
Received 1 December 2005; Revised 28 August 2006; Accepted 28 August 2006 Recommended by Ichiro Fujinaga This paper describes a system for optical music recognition (OMR) in case of monophonic typeset scores. After clarifying the difficulties specific to this domain, we propose appropriate solutions at both image analysis level and high-level interpretation. Thus, a recognition and segmentation method is designed, that allows dealing with common printing defects and numerous symbol interconnections. Then, musical rules are modeled and integrated, in order to make a consistent decision. This high-level interpretation step relies on the fuzzy sets and possibility framework, since it allows dealing with symbol variability, flexibility, and imprecision of music rules, and merging all these heterogeneous pieces of information. Other innovative features are the indication of potential errors and the possibility of applying learning procedures, in order to gain in robustness. Experiments conducted on a large data base show that the proposed method constitutes an interesting contribution to OMR. Copyright © 2007 F. Rossant and I. Bloch. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1.
INTRODUCTION
This paper proposes improvements and extensions to our earlier work on optical music recognition (OMR) [1]. OMR aims at automatically reading scanned scores in order to convert them into an electronic format, such as an MIDI file, or an audio waveform. This conversion requires a symbolic representation of the score content, achieved through recognition of its individual components and their structure. The motivation for OMR is manifold, and possible applications cover several topics addressed in this special issue, including automatic transcription, editing, transposition and arrangement, semantic analysis, fingerprinting (which is facilitated by the symbolic representation), feature extraction, indexing and mining, which are important components of query systems, and can all benefit from symbolic representations. The literature acknowledges active research in the 1970’s and 1980’s, see, for example, the reviews in [2, 3], until the first commercial products in the early 1990’s. The success of these works relies heavily on available knowledge (as opposed to other document analysis problems): reasonable number of symbols, strict location of the staff lines, strong rules of music writing. But still, the problem remains difficult and
solutions are generally computationa
Data Loading...