[DOM-Q-NET] Grounded RL on Structured Language | AISC Author Speaking

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on April 2, 2019 12:56:10 AM ● Video Link: https://www.youtube.com/watch?v=T6-7ASpoUp4

Duration: 1:50:00

712 views

For more details including paper and slides, visit https://aisc.a-i.science/events/2019-04-01/

Lead/first author: Sheng Jia
Facilitator: Nicolai Pogrebnyakov

Abstract

Building agents to interact with the web would allow for significant improvements in knowledge understanding and representation learning. However, web navigation tasks are difficult for current deep reinforcement learning (RL) models due to the large discrete action space and the varying number of actions between the states. In this work, we introduce DOM-Q-NET, a novel architecture for RL-based web navigation to address both of these problems. It parametrizes Q functions with separate networks for different action categories: clicking a DOM element and typing a string input. Our model utilizes a graph neural network to represent the tree-structured HTML of a standard web page. We demonstrate the capabilities of our model on the MiniWoB environment where we can match or outperform existing work without the use of expert demonstrations. Furthermore, we show 2x improvements in sample efficiency when training in the multi-task setting, allowing our model to transfer learned behaviours across tasks.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2019-05-02	Revolutionary Deep Learning Method to Denoise EEG Brainwaves
2019-04-25	[LISA] Linguistically-Informed Self-Attention for Semantic Role Labeling \| AISC
2019-04-23	How goodness metrics lead to undesired recommendations
2019-04-22	Deep Neural Networks for YouTube Recommendation \| AISC Foundational
2019-04-18	[Phoenics] A Bayesian Optimizer for Chemistry \| AISC Author Speaking
2019-04-18	Why do large batch sized trainings perform poorly in SGD? - Generalization Gap Explained \| AISC
2019-04-16	Structured Neural Summarization \| AISC Lunch & Learn
2019-04-11	Deep InfoMax: Learning deep representations by mutual information estimation and maximization \| AISC
2019-04-08	ACT: Adaptive Computation Time for Recurrent Neural Networks \| AISC
2019-04-04	[FFJORD] Free-form Continuous Dynamics for Scalable Reversible Generative Models (Part 1) \| AISC
2019-04-01	[DOM-Q-NET] Grounded RL on Structured Language \| AISC Author Speaking
2019-03-31	5-min [machine learning] paper challenge \| AISC
2019-03-28	[Variational Autoencoder] Auto-Encoding Variational Bayes \| AISC Foundational
2019-03-25	[GQN] Neural Scene Representation and Rendering \| AISC
2019-03-21	Towards Interpretable Deep Neural Networks by Leveraging Adversarial Examples \| AISC
2019-03-18	Understanding the Origins of Bias in Word Embeddings
2019-03-14	[Original Style Transfer] A Neural Algorithm of Artistic Style \| TDLS Foundational
2019-03-11	[RecSys 2018 Challenge winner] Two-stage Model for Automatic Playlist Continuation at Scale \|TDLS
2019-03-07	[OpenAI GPT2] Language Models are Unsupervised Multitask Learners \| TDLS Trending Paper
2019-03-04	You May Not Need Attention \| TDLS Code Review
2019-02-28	[DDQN] Deep Reinforcement Learning with Double Q-learning \| TDLS Foundational

Tags:

deep learning

machine learning

gcn

deep reinforcement learning

learning representation

dom-net

dom-q-net

graph convolutional neural network

Channel	Latest
SipoMedia	6 hours ago
Grey Hunter	6 hours ago
TheGreatestRandom	6 hours ago
MrCyberSquadRocks	6 hours ago
Rolakek Gaming	7 hours ago
KnowledgeBase	7 hours ago
Lilmac Gaming	7 hours ago
Team4shooterOP	7 hours ago
Eggroll	7 hours ago
Chris THE COP	7 hours ago
DarkAssassinX	7 hours ago
Alex Risi Chan Gamer	7 hours ago
hugo abp yt	7 hours ago
casualgamerreed	8 hours ago
Remind TV	8 hours ago
Thalias Beleg	8 hours ago
Hanic Turbo	8 hours ago
Fortnite Emotes And Dances	8 hours ago
AndriaNoid	8 hours ago
chiligaming 64	8 hours ago
Gratisan	8 hours ago
JPSamm	8 hours ago
Rena's World of Gaming	8 hours ago
SlantedGaming	8 hours ago
Alexander Vizcaino - RealitySpin	8 hours ago