An Overview of Phonetic Encoding Algorithms

PDF / 230,294 Bytes
15 Pages / 612 x 792 pts (letter) Page_size
90 Downloads / 215 Views

E SCALE SYSTEMS CONTROL

An Overview of Phonetic Encoding Algorithms V. S. Vykhovanets∗,a , J. Du∗∗,b , and S. A. Sakulin∗∗,c ∗

Trapeznikov Institute of Control Sciences, Russian Academy of Sciences, Moscow, Russia ∗∗ Bauman Moscow State Technical University, Moscow, Russia e-mail: a [email protected], b [email protected], c [email protected] Received September 12, 2017 Revised September 12, 2017 Accepted May 25, 2020

Abstract—This paper presents an overview of the phonetic encoding algorithms designed to determine the similarity of words in sound (pronunciation). Phonetic encoding algorithms are divided into the algorithms for comparing words and the algorithms for determining the distance between words. Word comparison algorithms, such as SoundEx, NYSIIS, Daitch–Mokotoﬀ, Metaphone, and Polyphone, as well as algorithms for determining the distance between words, such as Levenshtein, Jaro, and N -grams, are described. For each algorithm, the advantages and shortcomings are discussed, and an analog for the Russian language is given. For eliminating the common shortcomings of phonetic encoding algorithms, the idea suggested in this paper is to use not the letter sequences of words, but the sequences of their elementary sounds. In this case, word recognition, record linkage, and word indexing by sounds are expected to improve. Keywords: phonetic encoding algorithm, phonetic distance, record linkage, word indexing by sound DOI: 10.1134/S0005117920100082

1. INTRODUCTION Phonetic encoding algorithms are algorithms for word indexing by sound, which use a letter sequence of a word and pronunciation rules to convert them into text (code, index, or key). If the encoding texts for two diﬀerent words coincide or are close to one another, the resulting conclusion is that these words are similar in sound. The ﬁrst phonetic encoding algorithm for the English language, SoundEx [69], was used in the 1930s to encode surnames in a population census. The algorithm is based on an encoding method designed to eliminate spelling and typographical errors in names. For example, SoundEx will calculate the same encoding text “S530” for such words as “Smith,” “Smithe,” and “Smyth,” i.e., will identify these words as identical in sound. With the development of computer technologies, many other phonetic encoding algorithms appeared, including the ones for other natural languages. Examples of such algorithms are NYSIIS [77], the advanced SoundEx [75], Metaphone [40], and others. Phonetic encoding algorithms include not only algorithms for comparing words, but also algorithms for determining the distance between words in the case of searching by sound. The algorithms for calculating the Levenshtein distance [2] and the Jaro distance [30], as well as the distance based on N -grams [33] are most widespread in applications. Phonetic encoding algorithms are widely used in the ﬁelds where the comparison of acoustic data with text samples is required. Some example include speech recognition, word spelling check and correction, database search, extrac

Data Loading...

An Overview of Phonetic Encoding Algorithms

Recommend Documents

Phonetic Awareness, Phonetic Sensitivity and the Second Language Learner

Phonetic Search

Evaluating Phonetic Search KWS

Algorithms for Efficient and Fast 3D-HEVC Depth Map Encoding

An Overview of Psychodrama

An Overview

An Overview of Glimpse

An Overview

An Overview of Meningiomas

An Overview of Blockchain Consensus Algorithms: Comparison, Challenges and Future Directions

An Overview of the Tools

An Overview of Artificial Intelligence