Analysis of Rare Categories
In many real-world problems, rare categories (minority classes) play essential roles despite their extreme scarcity. The discovery, characterization and prediction of rare categories of rare examples may protect us from fraudulent or malicious behavior, a
- PDF / 1,661,195 Bytes
- 142 Pages / 439.37 x 666.14 pts Page_size
- 67 Downloads / 184 Views
Editorial Board: A. Bundy J. G. Carbonell M. Pinkal H. Uszkoreit M. Veloso W. Wahlster M. J. Wooldridge
For further volumes: http://www.springer.com/series/5216
Jingrui He
Analysis of Rare Categories
Dr. Jingrui He Machine Learning Group IBM T.J. Watson Research Center 1101 Kitchawan Road, Route 134 Yorktown Heights, NY 10598 USA Managing Editors Prof. Dov M. Gabbay Augustus De Morgan Professor of Logic Department of Computer Science King’s College London Strand, London WC2R 2LS, UK
Prof. Dr. Jörg Siekmann Forschungsbereich Deduktions- und Multiagentensysteme, DFKI Stuhlsatzenweg 3, Geb. 43 66123 Saarbrücken, Germany
Cognitive Technologies ISSN 1611-2482 ISBN 978-3-642-22812-4 e-ISBN 978-3-642-22813-1 DOI 10.1007/978-3-642-22813-1 Springer Heidelberg Dordrecht London New York Library of Congress Control Number: 2011941602 ACM Codes: I.2.6, E.1, H.2.8 © Springer-Verlag Berlin Heidelberg 2012 This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilm or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law. The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com)
Preface In many real world problems, rare categories (minority classes) play an essential role despite their extreme scarcity. For example, in financial fraud detection, the vast majority of financial transactions are legitimate, and only a small number may be fraudulent; in Medicare fraud detection, the percentage of bogus claims is small, but the total loss is significant; in network intrusion detection, malicious network activities are hidden among huge volumes of routine network traffic; in astronomy, only 0.001% of the objects in sky survey images are truly beyond the scope of current science and may lead to new discoveries; in spam image detection, near-duplicate spam images are difficult to discover from the large number of non-spam images; in rare disease diagnosis, rare diseases affect less than 1 out of 2000 people, but the consequences can be very severe. Therefore, the discovery, characterization and prediction of rare categories or rare examples may protect us from fraudulent or malicious behaviors, provide aid for scientific discoveries, and even save lives. This book focuses on the analysis of rare categories, where the majority classes have a smooth distribution, and the minori
Data Loading...