Geospatial Semantic Web

  • PDF / 2,810,254 Bytes
  • 100 Pages / 547.087 x 737.008 pts Page_size
  • 35 Downloads / 201 Views

DOWNLOAD

REPORT


G/Technology

to estimate variances at unsampled locations, aiding in the design of targeted sampling strategies.

 Intergraph: Real Time Operational Geospatial

Applications

Gaussian  Hurricane Wind Fields, Multivariate Modeling

Gaussian Process Models in Spatial Data Mining NAREN R AMAKRISHNAN 1, C HRIS B AILEY-K ELLOGG 2 Department of Computer Science, Virginia Tech, Blacksburg, VA, USA 2 Department of Computer Science, Dartmouth College, Hanover, NH, USA

1

Synonyms Active data mining Definition Gaussian processes (GPs) are local approximation techniques that model spatial data by placing (and updating) priors on the covariance structures underlying the data. Originally developed for geo-spatial contexts, they are also applicable in general contexts that involve computing and modeling with multi-level spatial aggregates, e. g., modeling a configuration space for crystallographic design, casting folding energies as a function of a protein’s contact map, and formulation of vaccination policies taking into account social dynamics of individuals. Typically, we assume a parametrized covariance structure underlying the data to be modeled. We estimate the covariance parameters conditional on the locations for which we have observed data, and use the inferred structure to make predictions at new locations. GPs have a probabilistic basis that allow us

Historical Background The underlying ideas behind GPs can be traced back to the geostatistics technique called kriging [4], named after the South African miner Danie Krige. Kriging in this literature was used to model response variables (e. g., ozone concentrations) over 2D spatial fields as realizations of a stochastic process. Sacks et al. [12] described the use of kriging to model (deterministic) computer experiments. It took more than a decade from this point for the larger computer science community to investigate GPs for pattern analysis purposes. Thus, in the recent past, GPs have witnessed a revival primarily due to work in the statistical pattern recognition community [5] and graphical models literature [3]. Neal established the connection between Gaussian processes and neural networks with an infinite number of hidden units [8]. Such relationships allow us to take traditional learning techniques and re-express them as imposing a particular covariance structure on the joint distribution of inputs. For instance, we can take a trained neural network and mine the covariance structure implied by the weights (given mild assumptions such as a Gaussian prior over the weight space). Williams motivates the usefulness of such studies and describes common covariance functions [14]. Williams and Barber [15] describe how the Gaussian process framework can be extended to classification in which the modeled variable is categorical. Since these publications were introduced, interest in GPs has exploded with rapid publications in conferences such as ICML, NIPS; see also the recently published book by Rasmussen and Williams [11]. Scientific Fundamentals A GP can be formally d