Natural Language Processing. Dependency Parsing

Similar documents
LR(1) Parsers Part III Last Parser Lecture. Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved.

Rational Parsing. John Hale

Situierte Generierung

Lesson Plan. Integrated Approach. Day 4: Apply. Day 1: Warm-Up. Day 3: Practice. Day 2: Teach. Day 5: Assess

Annotating, Extracting, and Linking Legal Information

5 Tell students that they re going to read a text. 7 Ask students to carefully read the sentences in

Information Extraction slides adapted from Jim Martin s Natural Language Processing class

A Review of Airport Runway Scheduling

STANDARDS MAP Basic Programs 1 and 2 English Language Arts Content Standards Grade Five

CSCI 5832 Natural Language Processing

Kurt Winkler 1, Tobias Kuhn 2, Martin Volk 3

An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem 1

Daily Estimation of Passenger Flow in Large and Complicated Urban Railway Network. Shuichi Myojo. Railway Technical Research Institute, Tokyo, Japan

Making the most of school-level per-student spending data

30 Day Intensive Course. Unit 2 At the Airport. Now you can look at the Key Words while the teacher explains them.

Performance and Efficiency Evaluation of Airports. The Balance Between DEA and MCDA Tools. J.Braz, E.Baltazar, J.Jardim, J.Silva, M.

Knowlywood: Mining Activity Knowledge from Hollywood Narratives

Beyond tagging: segmentation+labeling tasks. Intro to NLP - ETHZ - 25/03/2013

PBN AIRSPACE CONCEPT WORKSHOP. SIDs/STARs/HOLDS. Continuous Descent Operations (CDO) ICAO Doc 9931

(12) Patent Application Publication (10) Pub. No.: US 2005/ A1

AN ANALYSIS OF THEME AND RHEME USED IN THE FIVE-STAR HOTELS WEB SITES IN SURABAYA

Solving Clustered Oversubscription Problems for Planning e-courses

SFL/METU Fall 2017 DBE WHO 3 NLL ELE Unit 5. ELEMENTARY GROUP (Student s Copy)

Activity Template. Drexel-SDP GK-12 ACTIVITY. Subject Area(s): Sound Associated Unit: Associated Lesson: None

Scalable Runtime Support for Data-Intensive Applications on the Single-Chip Cloud Computer

CS229: AUTUMN Application of Machine Learning Algorithms to Predict Flight Arrival Delays

Weight and Balance User Guide

1 Adda247 No. 1 APP for Banking & SSC Preparation Website:store.adda247.com

NATIONAL AIRSPACE POLICY OF NEW ZEALAND

Mathcad Prime 3.0. Curriculum Guide

CASM electric cylinders The modular electric cylinder system

Appendix B Ultimate Airport Capacity and Delay Simulation Modeling Analysis

Aaron Marcus and Associates, Inc Euclid Avenue, Suite 1F Berkeley, CA , USA

SFL/METU Fall 2017 DBE WHO 3 NLL ELE Unit 5. ELEMENTARY GROUP (Instructor s Copy)

Lesson 82: Air Transport (20-25 minutes)

Croat and Muslim politicians in Mostar continue scheming to ruin the dream of a reunified Bosnia.

THE DEPARTMENT OF HIGHER EDUCATION UNIVERSITY OF COMPUTER STUDIES FIFTH YEAR

City of College Park s Update to the Zoning Ordinance

o " tar get v moving moving &

Air Traffic Information System

National Wilderness Steering Committee

ROLLER COASTER POLYNOMIALS Due: Thursday, March 30th

CONNECT Events: Flight Optimization

Baggage Check Weighing Machines. Service Overview. Prepare for Check In

IRISH AVIATION AUTHORITY DUBLIN POINT MERGE. Presented by James O Sullivan PANS-OPS & AIRSPACE INSPECTOR Irish Aviation Authority

CASM electric cylinders

AMUSEMENT MACHINES CONTRACT. Part 1 which set outs out key details relating to the contract;

Analysis of Operational Impacts of Continuous Descent Arrivals (CDA) using runwaysimulator

San Ġorġ Preca College Half Yearly Exams Year 5 English Time: 1h 15 min Reading Comprehension, Language and Writing

ROLLER COASTER POLYNOMIALS

Integrated Optimization of Arrival, Departure, and Surface Operations

Alternative Assessment Parts of Speech Create a Story

SIMULATION OF BOSNIA AND HERZEGOVINA AIRSPACE

PBN and airspace concept

ENAIRE DEVELOPMENTS NM B2B WEB SERVICES TECHNICAL FORUM. enaire.es. Network Manager nominated by the European Commission

Assignment of Arrival Slots

Aviation Trends. Quarter Contents

CURRENT SHORT-RANGE TRANSIT PLANNING PRACTICE. 1. SRTP -- Definition & Introduction 2. Measures and Standards

Preparatory Course in Business (RMIT) SIM Global Education. Bachelor of Applied Science (Aviation) (Top-Up) RMIT University, Australia

Clustering radar tracks to evaluate efficiency indicators Roland Winkler Annette Temme, Christoph Bösel, Rudolf Kruse

Grammar and Punctuation. Focus texts. Carol Matchett. For use with Schofield & Sims Grammar 5 Teacher s Guide Schofield & Sims Ltd, 2017.

Aviation Trends Quarter

Math 3 Polynomials Project

1224 Splitter and CTO combo, setup instructions using the Panelview HMI

FOR SMALL AND MEDIUM SIZED AIRPORTS Velocity FIDS

Title ID Number Sequence and Duration. Age Level Essential Question Learning Objectives

2. (1pt) From an aircraft, how can you tell the difference between a snowfield and a snow-covered glacier?

Schedule Compression by Fair Allocation Methods

Aeronautics Math. Douglas Anderson Arellanes Junior High School Santa Maria-Bonita School District

Development of Flight Inefficiency Metrics for Environmental Performance Assessment of ATM

A Multi-Agent Microsimulation Model of Toronto Pearson International Airport

Implementing an Air Taxi System

Attachment C: 2017/2018 Halifax Transit Year End Performance Report. 2017/2018 Year End Performance Measures Report

Pre-lab questions: Physics 1AL CONSERVATION OF MOMENTUM Spring Introduction

Surveillance and Broadcast Services

WELCOME!!!! IA & IB ENGLISH LANGUAGE SCHOOL YEAR ENTRY UNIT. website:

Network Revenue Management

Genetic Algorithms Applied to Airport Ground Traffic Optimization

Orientation and Conferencing Plan

Network Safeworking Rules and Procedures

(DRAFT) AFI REDUCED VERTICAL SEPARATION MINIMUM (RVSM) RVSM SAFETY POLICY

1 Introduction 2 2 Acknowledgements 2 3 Differences between Green Star SA rating tools 2 4 About the Calculator 2 5 How to Use the Calculator 2

The type rating of test pilots having flown the aircraft for its development and certification needs to be addressed as a special case.

AQME 10 System Description

FAA SAFO Turbojet Braking Performance on Wet Runway SAPOE Recommendations

Digital twin for life predictions in civil aerospace

Lecture 2: Image Classification pipeline. Fei-Fei Li & Andrej Karpathy Lecture 2-1

International Conference on Integrated Modular Avionics Moscow

6.0 JET ENGINE WAKE AND NOISE DATA. 6.2 Airport and Community Noise

Runway Length Analysis Prescott Municipal Airport

INTERNATIONAL CIVIL AVIATION ORGANIZATION WESTERN AND CENTRAL AFRICA OFFICE. Thirteenth Meeting of the FANS I/A Interoperability Team (SAT/FIT/13)

Worksheet: Resolving Trail Use(r) Conflict March 27, 2010

CHAPTER 4: PERFORMANCE

Traffic Flow Management

Consider the following: Do you travel on low cost airlines? What is your favorite airline? Are the tickets expensive? Do you get food and movies?

UNIT 3. Τhrills and Spills! Look at the picture on this page. Why do you think the Unit has the title 'Thrills and Spills'? Discuss as a class.

EN-024 A Simulation Study on a Method of Departure Taxi Scheduling at Haneda Airport

Short-Haul Operations Route Support Scheme (RSS)

A Turing Machine In Conway's Game Life. Paul Rendell

Tech Trends That Are Defining The Future Of Travel

Transcription:

Natural Language Processing Dependency Parsing

Dependency grammar The term dependency grammar does not refer to a specific grammar formalism. Rather, it refers to a specific way to describe the syntactic structure of a sentence.

Dependency grammar The notion of dependency The basic observation behind constituency is that groups of words may act as one unit. Example: noun phrase, prepositional phrase The basic observation behind dependency is that words have grammatical functions with respect to other words in the sentence. Example: subject, modifier

Dependency grammar Phrase structure trees S NP VP Pro Verb NP booked Det Nom a Nom PP Noun flight

Dependency grammar Dependency trees! dobj! subj det pmod! booked a flight! n an arc h d, the word h is called the head, and the word d is called the dependent. The arcs form a rooted tree.

Dependency grammar The history of dependency grammar The notion of dependency can be found in some of the earliest formal grammars. Modern dependency grammar is attributed to Lucien Tesnière (1893 1954). Recent years have seen a revived interest in dependency-based description of natural language syntax.

Dependency grammar Linguistic resources Descriptive dependency grammars exist for some natural languages. Dependency treebanks exist for a wide range of natural languages. These treebanks can be used to train accurate and efficient dependency parsers.

Ambiguity Just like phrase structure parsing, dependency parsing has to deal with ambiguity. dobj subj det pmod booked a flight

Ambiguity Just like phrase structure parsing, dependency parsing has to deal with ambiguity. dobj pmod subj det booked a flight

Disambiguation We need to disambiguate between alternative analyses. We develop mechanisms for scoring dependency trees, and disambiguate by choosing a dependency tree with the highest score.

Scoring models and parsing algorithms Distinguish two aspects: Scoring model: How do we want to score dependency trees? Parsing algorithm: How do we compute a highest-scoring dependency tree under the given scoring model?

The arc-factored model To score a dependency tree, score the individual arcs, and combine the score into a simple sum. score(t) = score(a1) + + score(an) Define the score of an arc h d as the weighted sum of all features of that arc: score(h d) = f1w1 + + fnwn

Arc-factored dependency parsing Examples of features The head is a verb. The dependent is a noun. The head is a verb and the dependent is a noun. The head is a verb and the predecessor of the head is a pronoun. The arc goes from left to right. The arc has length 2.

Arc-factored dependency parsing Training using structured prediction Take a sentence w and a gold-standard dependency tree g for w. Compute the highest-scoring dependency tree under the current weights; call it p. ncrease the weights of all features that are in g but not in p. Decrease the weights of all features that are in p but not in g.

Arc-factored dependency parsing Parsing algorithms Collins algorithm: Straightforward adaptation of CKY to dependency trees. Runs in O(w 5 ) time. Eisner s algorithm: mproves complexity by building the left and right halves of trees independently. Runs in O(w 3 ) time.

Natural Language Processing Transition-Based Dependency Parsing

Transition-based dependency parsing Eisner s algorithm runs in time O( w 3 ). This may be too much if a lot of data is involved. dea: Design a dumber but really fast algorithm and let the machine learning do the rest. Eisner s algorithm searches over many different dependency trees at the same time. A transition-based dependency parser only builds one tree, in one left-to-right sweep over the input.

Transition-based dependency parsing Transition-based dependency parsing The parser starts in an initial configuration. At each step, it asks a guide to choose between one of several transitions (actions) into new configurations. Parsing stops if the parser reaches a terminal configuration. The parser returns the dependency tree associated with the terminal configuration.

Transition-based dependency parsing Generic parsing algorithm Configuration c = parser.getnitialconfiguration(sentence)! while c is not a terminal configuration do! Transition t = guide.getnexttransition(c)! c = c.maketransition(t)! return c.getgraph()

Transition-based dependency parsing Guides We need a guide that tells us what the next transition should be. The task of the guide can be understood as classification: Predict the next transition (class), given the current configuration.

Transition-based dependency parsing Training a guide We let the parser run on gold-standard trees. Every time there is a choice to make, we simply look into the tree and do the right thing. We collect all (configuration, transition) pairs and train a classifier on them. When parsing unseen sentences, we use the trained classifier as a guide.

Transition-based dependency parsing Training a guide The number of (configuration, transition) pairs is far too large. We define a set of features of configurations that we consider to be relevant for the task of predicting the next transition. Example: word forms of the topmost two words on the stack and the next two words in the buffer We can then describe every configuration in terms of a feature vector.

Transition-based dependency parsing Training a guide configurations in which we want to do la score for feature 2 configurations in which we want to do ra score for feature 1

Transition-based dependency parsing Training a guide score for feature 2 la ra classification function learned by the classifier score for feature 1

Transition-based dependency parsing Training a guide n practical systems, we have thousands of features and hundreds of transitions. There are several machine-learning paradigms that can be used to train a guide for such a task. Examples: perceptron, decision trees, support-vector machines

The arc-standard algorithm The arc-standard algorithm is a simple algorithm for transition-based dependency parsing. t is very similar to shift reduce parsing as it is known for context-free grammars. t is implemented in most practical transitionbased dependency parsers, including MaltParser.

The arc-standard algorithm Configurations A configuration for a sentence w = w1 wn consists of three components: a buffer containing words of w a stack containing words of w the dependency graph constructed so far

The arc-standard algorithm Configurations nitial configuration: All words are in the buffer. The stack is empty. The dependency graph is empty. Terminal configuration: The buffer is empty. The stack contains a single word.

The arc-standard algorithm Possible transitions shift (sh): push the next word in the buffer onto the stack left-arc (la): add an arc from the topmost word on the stack, s1, to the second-topmost word, s2, and pop s2 right-arc (ra): add an arc from the second-topmost word on the stack, s2, to the topmost word, s1, and pop s1

The arc-standard algorithm Example run Stack Buffer booked a flight booked a flight

The arc-standard algorithm Example run Stack Buffer booked a flight booked a flight sh

The arc-standard algorithm Example run Stack Buffer booked a flight booked a flight

The arc-standard algorithm Example run Stack Buffer booked a flight booked a flight sh

The arc-standard algorithm Example run Stack Buffer booked a flight booked a flight

The arc-standard algorithm Example run Stack Buffer booked a flight booked a flight la-subj

The arc-standard algorithm Example run Stack Buffer booked a flight subj booked a flight

The arc-standard algorithm Example run Stack Buffer booked a flight subj booked a flight sh

The arc-standard algorithm Example run Stack Buffer booked a flight subj booked a flight

The arc-standard algorithm Example run Stack Buffer booked a flight subj booked a flight sh

The arc-standard algorithm Example run Stack Buffer booked a flight subj booked a flight

The arc-standard algorithm Example run Stack Buffer booked a flight subj booked a flight la-det

The arc-standard algorithm Example run Stack Buffer booked flight subj det booked a flight

The arc-standard algorithm Example run Stack Buffer booked flight subj det booked a flight sh

The arc-standard algorithm Example run Stack Buffer booked flight subj det booked a flight

The arc-standard algorithm Example run Stack Buffer booked flight subj det booked a flight ra-pmod

The arc-standard algorithm Example run Stack Buffer booked flight subj det pmod booked a flight

The arc-standard algorithm Example run Stack Buffer booked flight subj det pmod booked a flight ra-dobj

The arc-standard algorithm Example run Stack Buffer booked dobj subj det pmod booked a flight

The arc-standard algorithm Example run Stack Buffer booked dobj subj det pmod booked a flight done!