A Statistical Approach to Automatic Speech Summarization

PDF / 2,035,158 Bytes
12 Pages / 595.2 x 792 pts Page_size
65 Downloads / 237 Views

A Statistical Approach to Automatic Speech Summarization Chiori Hori Department of Computer Science, Tokyo Institute of Technology, 2-12-1 O-okayama, Meguro-ku, Tokyo 152-8552, Japan Email: [email protected]

Sadaoki Furui Department of Computer Science, Tokyo Institute of Technology, 2-12-1 O-okayama, Meguro-ku, Tokyo 152-8552, Japan Email: [email protected]

Rob Malkin Interactive Systems Labs, Carnegie Mellon University, Pittsburgh, PA 15213, USA Email: [email protected]

Hua Yu Interactive Systems Labs, Carnegie Mellon University, Pittsburgh, PA 15213, USA Email: [email protected]

Alex Waibel Interactive Systems Labs, Carnegie Mellon University, Pittsburgh, PA 15213, USA Email: [email protected] Received 20 March 2002 and in revised form 11 November 2002 This paper proposes a statistical approach to automatic speech summarization. In our method, a set of words maximizing a summarization score indicating the appropriateness of summarization is extracted from automatically transcribed speech and then concatenated to create a summary. The extraction process is performed using a dynamic programming (DP) technique based on a target compression ratio. In this paper, we demonstrate how an English news broadcast transcribed by a speech recognizer is automatically summarized. We adapted our method, which was originally proposed for Japanese, to English by modifying the model for estimating word concatenation probabilities based on a dependency structure in the original speech given by a stochastic dependency context free grammar (SDCFG). We also propose a method of summarizing multiple utterances using a two-level DP technique. The automatically summarized sentences are evaluated by summarization accuracy based on a comparison with a manual summary of speech that has been correctly transcribed by human subjects. Our experimental results indicate that the method we propose can eﬀectively extract relatively important information and remove redundant and irrelevant information from English news broadcasts. Keywords and phrases: speech summarization, summarization scores, two-level dynamic programming, stochastic dependency context free grammar, summarization accuracy.

1.

INTRODUCTION

The revolutionary increases in the computing power and storage capacity have enabled an enormous amount of speech data, or multimedia data that includes speech, to be managed as an information source. The next step is to create a system in which speech data is tagged (annotated) by text allowing information to be retrieved and extracted from such

databases. Multimedia databases including indexes can be automatically constructed using speech-recognition systems. Speech can be broadcast with captions generated by speechrecognition systems and simultaneously saved in speech and text (i.e., captions) archives in a database. Captioning can be considered a form of indexing accessible by individual words in the whole speech. One approach attempted to extract information from such a database by tracking speech through

A Statistical

Data Loading...

A Statistical Approach to Automatic Speech Summarization

Recommend Documents

Automatic speech recognition: a survey

Dataset for Automatic Summarization of Russian News

Automatic Arabic Text Summarization Using Analogical Proportions

Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach

A Discussion on Various Methods in Automatic Abstractive Text Summarization

Automatic Speech Recognition of Galo

Robust Adaptation to Non-Native Accents in Automatic Speech Recognition

Leverage Unlabeled Data for Abstractive Speech Summarization with Self-supervised Learning and Back-Summarization

A Deep Learning Generative Approach for Speech-to-Scene Generation

Process Optimization A Statistical Approach

A roadmap to neural automatic post-editing: an empirical approach

Arabic text summarization using deep learning approach