A novel active multi-source transfer learning algorithm for time series forecasting

  • PDF / 3,727,221 Bytes
  • 25 Pages / 595.276 x 790.866 pts Page_size
  • 59 Downloads / 122 Views

DOWNLOAD

REPORT


A novel active multi-source transfer learning algorithm for time series forecasting Qitao Gu 1 & Qun Dai 1

# Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract In Time Series Forecasting (TSF), researchers usually assume that there is enough training data can be obtained, with the old a`nd new data satisfying the same distribution. However, time series data always produces some time-varying characteristics over time, which will lead to relatively large differences between old and new data. As we all know, single-source TSF Transfer Learning (TL) faces the problem of negative transfer. Addressing this issue, this paper proposes a new Multi-Source TL algorithm, abbreviated as the MultiSrcTL algorithm, and a novel Active Multi-Source Transfer Learning, abbreviated as the AcMultiSrcTL algorithm, with the latter one integrating Multi-Source TL with Active Learning (AL), and taking the former one as its sub-algorithm. We introduce domain adaptation theory into this work, and analyze the expected target risk of TSF under the multi-source setting, accordingly. For the development of MultiSrcTL, we make full use of source similarity and domain dependability, using the Maximum Mean Discrepancy statistical indicator to measure the similarity between domains, so as to promote better transfer. A domain relation matrix is constructed to describe the relationship between source domains, so that the source-source and source-target relations are adequately considered. In the design of AcMultiSrcTL, Kullback-Leibler divergence is used to measure the similarity of related indicators to select the appropriate source domain. The uncertainty sampling method and the distribution match weighting technique are integrated, obtaining a new sample selection scheme. The empirical results on six benchmark datasets demonstrate the applicability and effectiveness of the two proposed algorithms for multi-source TSF TL. Keywords Time series forecasting (TSF) . Transfer learning (TL) . Multi-source transfer learning (MSTL) . Active multi-source transfer learning (AMSTL)

1 Introduction Time series refers to the series of numerical values of the same statistical indicator arranged in chronological order, while time series analysis and prediction are important methods of dynamic data analysis and processing. Since people want to obtain future data based on historical data, Time Series Forecasting (TSF) is widely applied in engineering, finance, natural sciences, and so on. During the decades from the presentation of time series to the present, with the development of mathematics and machine learning theories, many TSF algorithms with good performance have been proposed, such

* Qun Dai [email protected] 1

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

as Exponential Smoothing (ES) [1], Autoregressive Moving Average (ARMA) model [2], Autoregressive Integrated Moving Average (ARIMA) model [3], Neural Networks (NNs) [4], Support Vector Machine (SVM) [5],