iSAX 2.0: Indexing and Mining One Billion Time Series; Database Cracking

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=cE7FWD15_oA



Duration: 1:25:35
732 views
9


iSAX 2.0: Indexing and Mining One Billion Time Series abstract -------- There is an increasingly pressing need, by several applications in diverse domains, for developing techniques able to index and mine very large collections of time series. Examples of such applications come from astronomy, biology, the web, and other domains. It is not unusual for these applications to involve numbers of time series in the order of hundreds of millions to billions. In this paper, we describe iSAX 2.0, a data structure designed for indexing and mining truly massive collections of time series. We show that the main bottleneck in mining such massive datasets is the time taken to build the index, and we thus introduce a novel bulk loading mechanism, the first of this kind specifically tailored to a time series index. We show how our method allows mining on datasets that would otherwise be completely untenable, including the first published experiments to index one billion time series, and experiments in mining massive data from domains as diverse as entomology, DNA and web-scale image collections. Database Cracking and the Path Towards Auto-tuning Database Kernels ABSTRACT: Database cracking targets dynamic and exploratory environments where there is no sufficient workload knowledge and idle time to invest in physical design preparations and tuning. With DB cracking indexes are built incrementally, adaptively and on demand; each query is seen as an advice on how data should be stored. With each incoming query, data is reorganized on-the-fly as part of the query operators, while future queries exploit and continuously enhance this knowledge. Autonomously, adaptively and without any external human administration, the system quickly adapts to a new workload and reaches optimal performance when the workload stabilizes. We will talk about the basics of DB cracking including selection cracking, partial and sideways cracking and updates. We will also talk about important open and on going research issues such as disk based cracking, concurrency control and integration of cracking with offline and online index analysis.




Other Videos By Microsoft Research


2016-08-16Spanish forest dynamics: Constraining models with data
2016-08-16Real versus Realistically Rendered Ground Truth
2016-08-16How to solve a 112-bit ECDLP using game consoles
2016-08-16A First Look at a Commercial Hybrid Content Delivery System
2016-08-16Understanding Roles in Software Organizations
2016-08-16Natural Interactions & Computing for Global Development at MSR India
2016-08-16On Queues and Numbers
2016-08-16PAC-Bayesian Machine Learning: Learning by Optimizing a Performance Guarantee
2016-08-16Mini Talks on Multimedia, Interaction and Communication
2016-08-16Personal Health Information Management: More than just access to health records
2016-08-16iSAX 2.0: Indexing and Mining One Billion Time Series; Database Cracking
2016-08-16Piecewise Bounds for Estimating Bernoulli-Logistic Latent Gaussian Models
2016-08-16Topological quantum computing with Majorana Fermions
2016-08-16Building a global phenology model
2016-08-16Can you read this? If you cannot read, we can help you. If you can read, you can help us.
2016-08-16Surface Haptics: Virtual Touch on Physical Surfaces
2016-08-16TouchDevelop for Windows Phone: Write code for your phone on your phone!
2016-08-16Living in a de-material world: The design and maintenance of sustainable social networks
2016-08-16Metric and Ultrametric Modelling of Semantics and Change for Decision Making
2016-08-16Day 1 recap and discussion
2016-08-16Invited Demonstrations (Session II; 10 minutes/demonstrator) Mercé Crosas



Tags:
microsoft research