Incorporating Dictionary Features into Conditional Random Fields for Gene/Protein Named Entity Recognition

Biomedical Named Entity Recognition (BioNER) is an important preliminary step for biomedical text mining. Previous researchers built dictionaries of gene/protein names from online databases and incorporated them into machine learning models as features, b

  • PDF / 15,606,257 Bytes
  • 688 Pages / 430.15 x 660.926 pts Page_size
  • 39 Downloads / 192 Views

DOWNLOAD

REPORT


Subseries of Lecture Notes in Computer Science

4819

Takashi Washio Zhi-Hua Zhou Joshua Zhexue Huang Xiaohua Hu Jinyan Li Chao Xie Jieyue He Deqing Zou Kuan-Ching Li Mário M. Freire (Eds.)

Emerging Technologies in Knowledge Discovery and Data Mining PAKDD 2007 International Workshops Nanjing, China, May 22, 2007 Revised Selected Papers

13

Series Editors Jaime G. Carbonell, Carnegie Mellon University, Pittsburgh, PA, USA Jörg Siekmann, University of Saarland, Saarbrücken, Germany Volume Editors Takashi Washio, Osaka University, Japan ([email protected]) Zhi-Hua Zhou, Nanjing University, China ([email protected]) Joshua Zhexue Huang, The University of Hong Kong, China ([email protected]) Xiaohua Hu, Drexel University, USA ([email protected]) Jinyan Li, Nanyang Technological University, Singapore ([email protected]) Chao Xie, Georgia State University, USA ([email protected]) Jieyue He, Southeast University, China ([email protected]) Deqing Zou, Huazhong University of Science and Technology, China ([email protected]) Kuan-Ching Li, Providence University, Taiwan ([email protected]) Mário M. Freire, University of Beira Interior, Portugal ([email protected]) Library of Congress Control Number: 2007941074 CR Subject Classification (1998): I.2, H.2.7-8, H.3, H.5.1, G.3, J.3 LNCS Sublibrary: SL 7 – Artificial Intelligence ISSN ISBN-10 ISBN-13

0302-9743 3-540-77016-X Springer Berlin Heidelberg New York 978-3-540-77016-9 Springer Berlin Heidelberg New York

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law. Springer is a part of Springer Science+Business Media springer.com © Springer-Verlag Berlin Heidelberg 2007 Printed in Germany Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India Printed on acid-free paper SPIN: 12197576 06/3180 543210

Preface

The techniques of knowledge discovery and data mining (KDD) have rapidly developed along the significant progress of the computer and its network technologies in the last two decades. The attention and the number of researchers in this domain continue to grow in both international academia and industry. The Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) is a worldwide representative international conference on the research areas of KDD. Under the current spread of KDD techniques in our society, PAKDD 2007 invited the organizers to an industrial track on KDD techniques and, moreover, called for enterprising proposals of workshops on novel and emerging topics in KDD