Skip to main content
Thesis defences

PhD Oral Exam - Mahsa Orang, Computer Science

Similarity Search and Analysis Techniques for Uncertain Time Series Data


Date & time
Monday, July 27, 2015
10 a.m. – 1 p.m.
Cost

This event is free

Organization

School of Graduate Studies

Contact

Sharon Carey
514-848-2424 ext. 3802

Wheel chair accessible

Yes

When studying for a doctoral degree (PhD), candidates submit a thesis that provides a critical review of the current state of knowledge of the thesis subject as well as the student’s own contributions to the subject. The distinguishing criterion of doctoral graduate research is a significant and original contribution to knowledge.

Once accepted, the candidate presents the thesis orally. This oral exam is open to the public.

Abstract

Emerging applications, such as wireless sensor networks and location-based services, require the ability to analyze large quantities of uncertain time series, where the exact value at each timestamp is unavailable or unknown. Traditional similarity search techniques used for standard time series are not always effective for uncertain time series data analysis. This motivates our work in this dissertation. We investigate new, efficient solution techniques for similarity search and analysis of both uncertain time series models, i.e., PDF-based uncertain time series (having probability density function) and multiset-based uncertain time series (having multiset of observed values) in general, as well as correlation queries in particular. In our research, we first formalize the notion of normalization. This notion is used to introduce the idea of correlation for uncertain time series data. We consider a class of probabilistic, threshold-based correlation queries over such data. We model uncertain correlation as a random variable that is a basis to develop techniques for similarity search and analysis of uncertain time series. Moreover, we propose a few query optimization and query quality improvement techniques. Finally, we demonstrate experimentally how the proposed techniques can improve similarity search in uncertain time series. We believe that our results provide a theoretical baseline for uncertain time series management and analysis tools that will be required to support many existing and emerging applications.

Back to top

© Concordia University