AAAI 2013 Tutorial on Textual Entailment

Sebastian Pado, Heidelberg University
Rui Wang, German Research Center for Artificial Intelligence

Introduction

The ability to draw inferences is a central part of language understanding. Traditionally, it has predominantly been approached through the development of formal representa- tions with provably complete and correct reasoning mechanisms. Textual Inference is a recent alternative approach that defines inference as a binary relation between two natural language text fragments, avoiding commitment to any specific knowledge repre- sentation, or reasoning mechanism. It hopes to establish a level playing field to compare and combine various approaches to inference and to establishing a task-independent paradigm for applied semantics. The proposed tutorial provides an introduction to Tex- tual Inference, spanning the range from fundamental to applied aspects. It would cover (a) definition and motivation; (b) major relevant linguistic phenomena; (c) methods for acquiring and applying linguistic knowledge for modeling inference; (d), major families 1 of algorithms; and (e) an introduction to practical system building using a new open and modular software platform.

Syllabus

Part 1. Basics: Inference and Textual Inference

Discuss the importance of inference for AI and Natural Language Processing;
Motivate Textual Inference (TI) as a probabilistic concept of inference on natural language;
Show the mapping of major NLP tasks (such as Question Answering and Machine Translation Evaluation) onto Textual Inference;
Discuss the relationship between TI and related tasks (e.g., paraphrasing, contradiction recognition);
Describe RTE (“Recognizing Textual Entailment”), the main forum for TI evaluation in the NLP community.

Part 2. Classes of Strategies and Learning

Characterize the major classes of strategies to approach TI;
Cover typical linguistic preprocessing steps;
Define representations/data structures typically used;
Provide technical detail on state-of-the-art inference procedures and machine learning techniques for the two most practically relevant classes of strategies (transformation-based and classification-based TI).

Part 3. Knowledge Acquisition

Discuss the importance of background knowledge in TI;
Identify knowledge resources used in current TI systems and their limitations;
Provide an overview of (different types of) knowledge acquisition approaches;
Define suitable representations and algorithms for using knowledge, including context-sensitive knowledge application.

Part 4. Applications

Outline main classes of NLP problems and their mapping into Textual Inference;
Go into technical detail for two applications:
Machine Translation Evaluation;
Hierarchical Information Exploration with Entailment Graphs.

Part 5. Multilingual, Component-based System Building

Describe the problems for evaluation and practical application raised by the predominance of research prototype systems in the TI area;
Present the component-based platform developed in the EXCITEMENT project (http://www.excitement-project.eu);
System demonstration.

Slides for Download

All slides are landscape, printed 2-up on a portrait page (A4).

Slides for Part 1 (2 MB, updated 7/12/13)
Slides for Part 2 (2 MB, updated 7/12/13)
Slides for Part 3 (2 MB, updated 7/12/13)
Slides for Part 4 (2 MB, updated 7/12/13)
Slides for Part 5 (2 MB, updated 7/12/13)
All slides in one file (17 MB, 7/12/13)

Sebastian Padó