Domain Adaptation for Visual Understanding

This unique volume reviews the latest advances in domain adaptation in the training of machine learning algorithms for visual understanding, offering valuable insights from an international selection of experts in the field. The text presents a diverse se

  • PDF / 15,038,578 Bytes
  • 148 Pages / 453.544 x 683.151 pts Page_size
  • 41 Downloads / 297 Views

DOWNLOAD

REPORT


daptation for Visual Understanding

Domain Adaptation for Visual Understanding

Richa Singh Mayank Vatsa Vishal M. Patel Nalini Ratha •



Editors

Domain Adaptation for Visual Understanding

123



Editors Richa Singh Indraprastha Institute of Information Technology Delhi New Delhi, India

Mayank Vatsa Indraprastha Institute of Information Technology Delhi New Delhi, India

Vishal M. Patel Johns Hopkins University Baltimore, MD, USA

Nalini Ratha IBM Thomas J. Watson Research Center Yorktown Heights, NY, USA

ISBN 978-3-030-30670-0 ISBN 978-3-030-30671-7 https://doi.org/10.1007/978-3-030-30671-7

(eBook)

© Springer Nature Switzerland AG 2020 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. This Springer imprint is published by the registered company Springer Nature Switzerland AG The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland

Preface

In many real-world vision applications, there are very few or even no labeled samples, while an unrelated general domain is often available with a large number of labeled examples. For example, ImageNet contains millions of loosely labeled images over a large number of general classes of objects. On the other hand, a medical researcher may be interested in retrieving brain cancer fMRI scans closer to the patient’s brain scan image. Such data may not be available in large volumes or may be expensive to put forth the effort to annotate their collections by themselves. The problem of a lack of training samples can be challenging because of the significant statistical distribution difference between the feature distributions of training samples from the known available domain and the application domain. Researchers have often resorted to many techniques such as fine-tuning, hard mining, transfer learning, and domain adaptation to effectively use the large training samples from one domain and still get