Joint Policy-Value Learning for Recommendation | AISC

Published on ● Video Link: https://www.youtube.com/watch?v=fMwxxbcXk8c



Duration: 52:33
233 views
8


For slides and more information on the paper, visit https://ai.science/e/dual-bandit-rec-sys-joint-policy-value-learning-for-recommendation--8Psibjssw4O3GBD5oK2u

Speaker: Olivier Jeunen; Discussion Facilitator: Omar Nada, Susan Shu Chang

Motivation:
Beating offline metrics in Recommender System is challenging but the real question would be how effective is the model in online metrics. This paper utilizes logged data from a model to come up with a higher online evaluation scores




Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE


2020-08-19Review Nuggets - Mining Insight from Consumer Product Reviews | Workshop Capstone
2020-08-19Fast Film - Emotionally Aware Movie Recommender | Workshop Capstone
2020-08-19Acetock - Stock Prediction Tool for Amateur Investors | Workshop Capstone
2020-08-19Saramsh - Patent Document Summarization using BART | Workshop Capstone
2020-08-19MindfulZen - Data Driven Stress Buster | Workshop Capstone
2020-08-14Machine Learning and the Earth: Applying AI to address some of the world’s greatest challenges
2020-08-13Xun Wang (GEICO): 7 Job Profiles to Demystify the Data Science Career Landscape
2020-08-12Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning | AISC
2020-08-12Computer v.s. Human visual system | AISC
2020-08-12AI Fariness and Adversarial Debiasing
2020-08-11Joint Policy-Value Learning for Recommendation | AISC
2020-08-11Operationalizing the AI Canvas for AI Product Success (and profit) | AISC
2020-08-07Overview of Bias and Fairness in AI
2020-08-06Subexponential-Time Algorithms for Sparse PCA | AISC
2020-08-05Inverse design of nanoporous crystalline reticular materials with deep generative models | AISC
2020-08-04ChemOS: An orchestration software to democratize autonomous discovery | AISC
2020-07-30Recurrent Neural Network for Quantum Wave Function | AISC
2020-07-30Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits | AISC
2020-07-30Information Retrieval for Price Consistency Monitoring - Liu Yang (Amazon)
2020-07-29Quantum Technologies: State of Play | AISC
2020-07-29NLP on Noisy User-generated text - NER for StackOverflow | AISC