Development of Large-Scale Grammars Through Corpus Construction (Japanese Audio)

Channel:

Google TechTalks

Subscribers:

349,000

Published on November 5, 2010 6:30:44 PM ● Video Link: https://www.youtube.com/watch?v=OH53yIb8ULk

Duration: 1:11:11

4,192 views

Google Tech Talk
14:00- JST Oct 27 2010
At Google Japan (Japanese Audio)

Speaker : Yusuke Miyao (宮尾祐介)
Bio : http://www.nii.ac.jp/en/faculty/digital_content/MIYAO-Yusuke/
Affiliation : National Institute of Informatics (国立情報学研究所)

Language : spoken in Japanese and slides in English

Title : Development of large-scale grammars through corpus construction

Abstract :
A crucial bottleneck of grammar-based deep parsing is the difficulty
of the development of large-scale grammars that can analyze real-world
sentences. In our approach, the final goal of grammar development is
the construction of a treebank (parsed corpus) that conforms to a
grammar theory. Given an existing corpus (e.g. Penn Treebank) and a
grammar theory, we can construct a treebank at low cost. Since a
large-scale lexicon can be extracted automatically from the treebank,
a large-scale grammar can be developed in a short period. In this
talk, I overview our method of corpus-based grammar development, in
comparison with manual grammar development and grammar learning.

Japanese title : コーパス構築に基づく大規模文法開発

Japanese abstract :
文法に基づく深い構文解析の最大の問題点は，実世界の文を解析できる大規模
文法の実装が困難なことである．コーパス構築に基づく文法開発手法では，文
法開発の最終目的を，文法理論に基づくツリーバンク（解析済みコーパス）の
構築と考える．既存のコーパス(Penn Treebank など)と文法理論を利用すると，
ツリーバンクは比較的低コストで構築することができる．すると，大規模辞書
はツリーバンクから自動獲得できるので，大規模文法を短期間で開発すること
が可能となった．本トークでは，人手による文法開発や文法学習と対比しなが
ら，コーパス構築に基づく文法開発手法を概説する．

Other Videos By Google TechTalks

2010-12-01	GTAC 2010: The Future of Front-End Testing
2010-12-01	GTAC 2010: Opening Remarks
2010-11-30	Heavy Ion Fusion
2010-11-29	A Vision For the Science of Imagination
2010-11-24	Improved Code Clone Categorization
2010-11-24	Mumbai Rising? India's Economic Rise and the United States
2010-11-22	Wokai: Microfinance and the Future of China
2010-11-22	Breaking Barriers with Sound (Ge Wang)
2010-11-16	Mirah, an Expressive JVM Language
2010-11-10	Getting Serious Games into the K-16 Classroom (Victoria Van Voorhis)
2010-11-05	Development of Large-Scale Grammars Through Corpus Construction (Japanese Audio)
2010-11-04	It Takes Two to Tango: The Human Future and the Future of Buddhism
2010-11-01	Fun is the Future: Mastering Gamification
2010-10-29	確率密度比を用いた新しい機械学習アルゴリズム
2010-10-29	Google Workshop on Quantum Biology: Welcome and Introduction
2010-10-28	Learning From Examples Using Quantum Annealing (Google Workshop on Quantum Biology)
2010-10-28	Electrodynamic Signaling by the Dendritic Cytoskeleton (Google Workshop on Quantum Biology)
2010-10-28	Clarifying the Tubulin bit/qubit - Defending the Penrose-Hameroff Orch OR Model (Quantum Biology)
2010-10-28	Experimental Studies on a Single Microtubule (Google Workshop on Quantum Biology)
2010-10-28	Microtubules - Electric Oscillating Structures in Living Cells (Google Workshop on Quantum Biology)
2010-10-28	Classical and Quantum Information in DNA (Google Workshop on Quantum Biology)

Tags:

google tech talk

computational linguistics

machine learning

Channel	Latest
Mr. Souer	6 hours ago
Obake PAM Ch.	6 hours ago
Shourize Hobby	6 hours ago
DARK Gaming	6 hours ago
KABEGON JAPAN	6 hours ago
せしるおじさん	6 hours ago
Cartoon Freak #	6 hours ago
PUBG: BATTLEGROUNDS INDONESIA	6 hours ago
Munam Aslam	6 hours ago
Yudi Syahputra	6 hours ago
TheOnlyAlphaGamer	6 hours ago
StephanZA	6 hours ago
MURASAKI 夢羅佐希 GAME日記	6 hours ago
Julius Preset • 37 rb x ditonton • 5 jam yang lalu	6 hours ago
Microboy	6 hours ago
Dialga22239	6 hours ago
GameXnews	6 hours ago
GB GAMER	7 hours ago
香口Karl	7 hours ago
realme Indonesia	7 hours ago
Ryuk Leonidas	7 hours ago
Gaming Raju	7 hours ago
BKCG gaming	7 hours ago
Avinash Gaming Official	7 hours ago
Kir Lucky	7 hours ago