Abstract
The Earth Observing System Data and Information System (EOSDIS) is a comprehensive data and information system which archives, manages, and distributes Earth science data from the EOS spacecrafts. One non-existent capability in the EOSDIS is the retrieval of satellite sensor data based on weather events (such as tropical cyclones) similarity query output. In this paper, we propose a framework to solve the similarity search problem given user-defined instance-level constraints for tropical cyclone events, represented by arbitrary length multidimensional spatiotemporal data sequences. A critical component for such a problem is the similarity/metric function to compare the data sequences. We describe a novel Longest Common Subsequence (LCSS) parameter learning approach driven by nonlinear dimensionality reduction and distance metric learning. Intuitively, arbitrary length multidimensional data sequences are projected into a fixed dimensional manifold for LCSS parameter learning. Similarity search is achieved through consensus among the (similar) instance-level constraints based on ranking orders computed using the LCSS-based similarity measure. Experimental results using a combination of synthetic and real tropical cyclone event data sequences are presented to demonstrate the feasibility of our parameter learning approach and its robustness to variability in the instance constraints. We, then, use a similarity query example on real tropical cyclone event data sequences from 2000 to 2008 to discuss (i) a problem of scientific interest, and (ii) challenges and issues related to the weather event similarity search problem.
Original language | English (US) |
---|---|
Title of host publication | KDD'10 - Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data |
Pages | 135-144 |
Number of pages | 10 |
DOIs | |
State | Published - Sep 7 2010 |
Externally published | Yes |
Event | 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010 - Washington, DC, United States Duration: Jul 25 2010 → Jul 28 2010 |
Other
Other | 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010 |
---|---|
Country/Territory | United States |
City | Washington, DC |
Period | 7/25/10 → 7/28/10 |
All Science Journal Classification (ASJC) codes
- Software
- Information Systems