Speaker-Related Robustness Issues

Speaker dependent factors, such as gender, physical condition (cold or laryngitis), speaking style (emotion state, speech rate, etc.), cross-language, accent and session variations, are major concerns in speech signal processing. How they correlate with e

  • PDF / 1,461,687 Bytes
  • 57 Pages / 439.37 x 666.142 pts Page_size
  • 37 Downloads / 189 Views

DOWNLOAD

REPORT


Thomas Fang Zheng Lantian Li

RobustnessRelated Issues in Speaker Recognition 123

SpringerBriefs in Electrical and Computer Engineering Signal Processing

Series editors Woon-Seng Gan, Singapore, Singapore C.-C. Jay Kuo, Los Angeles, USA Thomas Fang Zheng, Beijing, China Mauro Barni, Siena, Italy

More information about this series at http://www.springer.com/series/11560

Thomas Fang Zheng Lantian Li •

Robustness-Related Issues in Speaker Recognition

123

Thomas Fang Zheng Tsinghua National Laboratory for Information Science and Technology, Division of Technical Innovation and Development, Department of Computer Science and Technology Center for Speech and Language Technologies, Research Institute of Information Technology, Tsinghua University Beijing China

Lantian Li Tsinghua National Laboratory for Information Science and Technology, Division of Technical Innovation and Development, Department of Computer Science and Technology Center for Speech and Language Technologies, Research Institute of Information Technology, Tsinghua University Beijing China

ISSN 2191-8112 ISSN 2191-8120 (electronic) SpringerBriefs in Electrical and Computer Engineering ISSN 2196-4076 ISSN 2196-4084 (electronic) SpringerBriefs in Signal Processing ISBN 978-981-10-3237-0 ISBN 978-981-10-3238-7 (eBook) DOI 10.1007/978-981-10-3238-7 Library of Congress Control Number: 2017937265 © The Author(s) 2017 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Printed on acid-free paper This Springer imprint is published by Springer Nature The registered company is Springer Nature Singapore Pte Ltd. The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721, Singapore

Preface

Speaker recognition (known as voiceprint recognition in industry) is the process of automatically identifying or verifying the identity of a person from his/her voice, using the characteri