Computational Methods for Corpus Annotation and Analysis
In the past few decades the use of increasingly large text corpora has grown rapidly in language and linguistics research. This was enabled by remarkable strides in natural language processing (NLP) technology, technology that enables computers to automat
- PDF / 3,383,745 Bytes
- 192 Pages / 439.44 x 666.24 pts Page_size
- 6 Downloads / 210 Views
Xiaofei Lu
Computational Methods for Corpus Annotation and Analysis
1 3
Xiaofei Lu Department of Applied Linguistics The Pennsylvania State University University Park Pennsylvania USA
ISBN 978-94-017-8644-7 ISBN 978-94-017-8645-4 (eBook) DOI 10.1007/978-94-017-8645-4 Springer New York Heidelberg Dordrecht London Library of Congress Control Number: 2014931404 © Springer Science+Business Media Dordrecht 2014 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. Exempted from this legal reservation are brief excerpts in connection with reviews or scholarly analysis or material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. Duplication of this publication or parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’s location, in its current version, and permission for use must always be obtained from Springer. Permissions for use may be obtained through RightsLink at the Copyright Clearance Center. Violations are liable to prosecution under the respective Copyright Law. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. While the advice and information in this book are believed to be true and accurate at the date of publication, neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com)
Dedicated to my wife, Xiaomeng, and our daughter, Jasmine.
v
Preface
This book grew out of sets of lecture notes for a graduate course on computational methods for corpus annotation and analysis that I have taught in the Department of Applied Linguistics at The Pennsylvania State University since 2006. After several iterations of the course, my students and I realized that while there is an abundance of introductory sources on the fundamentals of corpus linguistics, most of them do not provide the types of detailed and systematic instructions that are necessary to help language and linguistics researches get off the ground with using computational tools other than concordancing programs for automatic corpus annotation and analysis. A large proportion of the students taking the course were not yet read
Data Loading...