NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Real time data...

Subscribers:
348,000
Published on ● Video Link: https://www.youtube.com/watch?v=yEI57SK0P0U



Duration: 38:06
3,293 views
16


Big Learning Workshop: Algorithms, Systems, and Tools for Learning at Scale at NIPS 2011
Invited Talk: Real time data sketches by Alex Smola

Alex is a Principal Researcher at Yahoo. Alex's current research focus is on nonparametric methods for estimation, in particular kernel methods and exponential families. This includes support vector Machines, gaussian processes, and conditional random fields.

Abstract: I will describe a set of algorithms for extending streaming and sketching algorithms to real time analytics. These algorithm captures frequency information for streams of arbitrary sequences of symbols. The algorithm uses the Count-Min sketch as its basis and exploits the fact that the sketching operation is linear. It provides real time statistics of arbitrary events, e.g.\ streams of queries as a function of time. In particular, we use a factorizing approximation to provide point estimates at arbitrary (time, item) combinations. The service runs in real time, it scales perfectly in terms of throughput and accuracy, using distributed hashing. The latter also provides performance guarantees in the case of machine failure. Queries can be answered in constant time regardless of the amount of data to be processed. The same distribution techniques can also be used for heavy hitter detection in a distributed scalable fashion.




Other Videos By Google TechTalks


2012-02-23NIPS 2011 Sparse Representation & Low-rank Approximation Workshop: Group Sparse Hidden Markov...
2012-02-23NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: A Common GPU...
2012-02-23The Relative Happiness Index (RHI)
2012-02-23A Chinese Typewriter in Silicon Valley
2012-02-233D Computer Vision: Past, Present, and Future
2012-02-20Knowledge is... Love
2012-02-16Meditate with Father Laurence Freeman
2012-02-14Agile C++ with Supporting Eclipse CDT Plug-ins
2012-02-14Santa Tracker - 1.6 Million Requests per Second
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Spark: In-Memory Cluster...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Real time data...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Hazy - Making Data-driven...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Block splitting for...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: No-U-Turn Sampler...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Graphlab 2...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Graphlab 2 Tutorial
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Large-Scale Matrix...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Randomized Smoothing for...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Machine Learning's Role...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: Fast Cross-Validation...
2012-02-13NIPS 2011 Big Learning - Algorithms, Systems, & Tools Workshop: High-Performance Computing...



Tags:
new
bigml
d2
smola