Knowledge Discovery for Business Information Systems

Current database technology and computer hardware allow us to gather, store, access, and manipulate massive volumes of raw data in an efficient and inexpensive manner. In addition, the amount of data collected and warehoused in all industries is growing e

  • PDF / 31,366,821 Bytes
  • 442 Pages / 468 x 756 pts Page_size
  • 80 Downloads / 325 Views

DOWNLOAD

REPORT


The Kluwer International Series in Engineering and Computer Science

Knowledge Discovery for Business Information Systems Edited by

Witold Abramowicz The University of Economics, Poland Jozef Zurada University of Louisville, U.S.A.

KLUWER ACADEMIC PUBLISHERS NEW YORK, BOSTON, DORDRECHT, LONDON, MOSCOW

eBook ISBN: Print ISBN:

0-306--46991-X 0-7923-7243-3

©2002 Kluwer Academic Publishers New York, Boston, Dordrecht, London, Moscow

All rights reserved

No part of this eBook may be reproduced or transmitted in any form or by any means, electronic, mechanical, recording, or otherwise, without written consent from the Publisher

Created in the United States of America

Visit Kluwer Online at: and Kluwer's eBookstore at:

http://www.kluweronline.com http://www.ebooks.kluweronline.com

Contents

PREFACE

xi

FOREWORD LIST OF CONTRIBUTORS Chapter 1

Chapter 2

INFORMATION FILTERS SUPPLYING DATA WAREHOUSES WITH BENCHMARKING INFORMATION Witold Abramowicz, 1. Introduction 2. Data Warehouses 3. The HyperSDI System 4. User Profiles in the HyperSDI System 5. Building Data Warehouse Profiles 6. Techniques for Improving Profiles 7. Implementation Notes 8. Conclusions References PARALLEL MINING OF ASSOCIATION RULES David Cheung, Sau Dan Lee 1. Introduction 2. Parallel Mining of Association Rules 3. Pruning Techniques and The FPM Algorithm 4. Metrics for Data Skewness and Workload Balance 5. Partitioning of the Database 6. Experimental Evaluation of the Partitioning Algorithms 7. Discussions 8. Conclusions References

xiii xv

1 1 2 4 11 11 18 22 25

29

29

32 33 39 48 56 62 64 65

CONTENTS

vi Chapter 3

UNSUPERVISED FEATURE RANKING AND SELECTION 67 Manoranjan Dash, Huan Liu, Jun Yao 1. Introduction 67 2. Basic Concepts and Possible Approaches 69 3. An Entropy Measure for Continuous and Nominal Data Types 72 4. Algorithm to Find Important Variables 75 5. Experimental Studies 76 6. Clustering Using SUD 80 7. Discussion and Conclusion 82 84 References

Chapter 4

APPROACHES TO CONCEPT BASED EXPLORATION OF INFORMATION RESOURCES Hele-Mai Haav, Jørgen Fischer Nilsson 1. Introduction 2. Conceptual Taxonomies 3. Ontology Driven Concept Retrieval 4. Search based on formal concept analysis 5. Conclusion Acknowledgements References

89 91 99 104 109 109 109

HYBRID METHODOLOGY OF KNOWLEDGE DISCOVERY FOR BUSINESS INFORMATION

111

Chapter 5

5. Hippe

1. Introduction 2. Present Status of Data Mining 3. Experiments with Mining Regularities from Data 4. Discussion Acknowledgements References Chapter 6

FUZZY LINGUISTIC SUMMARIES OF DATABASES FOR AN EFFICIENT BUSINESS DATA ANALYSIS AND DECISION SUPPORT Janusz Kacprzyk, Ronald R. Yager and 1. Introduction 2. Idea of Linguistic Summaries Using Fuzzy Logic with Linguistic Quantifiers 3. On Other Validity Criteria 4. Derivation of Linguistic Summaries via a Fuzzy Logic Based Database Querying Interface 5. Implementation for a Sales Database at a Computer Retailer

89

111 113 118

125 126

126

129 129 131 134 140 147

CONTENTS

vii 6. Concluding Remarks References

Chapter 7

Chapter 8