Research talk: SPTAG++: Fast hundreds of billions-scale vector search with millisecond response time

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=1wTb7gdiMsE



Duration: 10:09
544 views
0


Speaker: Qi Chen, Senior Researcher, Microsoft Research Asia

Current state-of-the-art vector approximate nearest neighbor search (ANNS) libraries mainly focus on how to do fast high-recall search in memory. However, extremely large-scale vector search scenarios present certain challenges. For example, hundreds of billions of vectors coupled with limited memory creates a capacity issue. There is also a scalability issue because increasing the number of serving machines increases query latency and computation costs. This occurs as a result of the search being done in each machine, and latency increases with the increased number of aggregating candidates. To address these challenges, we propose SPTAG++, a distributed ANNS system. In this talk, we’ll discuss SPTAG++, which is now integrated into production to support hundreds of billions-scale vector searches in production with millisecond response time and more than ten thousand queries per second.

Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit




Other Videos By Microsoft Research


2022-01-24Talk: Project Dexter: Machine learning and automatic decision-making for robotic manipulation
2022-01-24Lightning talks: Gaming and Entertainment: Content creation at scale
2022-01-24Research talk: Evaluating human-like navigation in 3D video games
2022-01-24Fireside chat: Opportunities and challenges in human-oriented AI
2022-01-24Plenary: Statistical Imaginaries: An Ode to Responsible Data Science
2022-01-24Fireside chat: Smart network pipes unleashing new opportunities
2022-01-24Closing Remarks: Reinforcement Learning
2022-01-24Keynote: The Future: Converging the Cloud & Telecommunications Infrastructures
2022-01-24Research talk: Capturing the visual evolution of fashion in space and time
2022-01-24Plenary: New Developments in Human-Computer Interaction
2022-01-24Research talk: SPTAG++: Fast hundreds of billions-scale vector search with millisecond response time
2022-01-24Research talk: Attentive knowledge-aware graph neural networks for recommendation
2022-01-24Practical tips for productivity & wellbeing: Lessons from COVID-19 around time management
2022-01-24Tutorial, Research talk, and Q&A: ElectionGuard: Enabling voters to verify election integrity
2022-01-24Panel: Causal ML at Microsoft
2022-01-24Panel: Computer vision in the next decade: Deeper or broader
2022-01-24Panel: Perspectives on the new future of hybrid meetings
2022-01-24Panel: Experiments, models, inference and algorithms: Learning from experts who do it all
2022-01-24Panel: Characteristics, learnings, and challenges of thriving organizations
2022-01-24Panel: Developer velocity and productivity
2022-01-24Panel: Causal ML in industry



Tags:
Search & information retrieval
Microsoft Search
Productivity
search
recommendation
future of search
microsoft research summit