TAC 2015 Tracks
    Cold Start KBP
    Tri-Lingual EDL
    Event
    Validation/Ensembling
Data
Schedule
Organizers
Track Registration
Reporting Guidelines
TAC 2015 Workshop

TAC Knowledge Base Population (KBP) 2015

Evaluation: February-November, 2015
Workshop: November 16-17, 2015

Conducted by:
U.S. National Institute of Standards and Technology (NIST)

With support from:
U.S. Department of Defense

Overview

The Text Analysis Conference (TAC) is a series of evaluation workshops organized to encourage research in Natural Language Processing and related applications, by providing a large test collection, common evaluation procedures, and a forum for organizations to share their results. TAC comprises sets of tasks known as "tracks," each of which focuses on a particular subproblem of NLP. TAC tracks focus on end-user tasks, but also include component evaluations situated within the context of end-user tasks.

The goal of TAC Knowledge Base Population (KBP) is to develop and evaluate technologies for populating knowledge bases (KBs) from unstructured text.

The Cold Start KBP task is to build a KB from scratch, using a predefined KB schema and a collection of unstructured text. The current KB schema consists of named entities that can be a person (PER), organization (ORG), or geopolitical entity (GPE); and predefined attributes (a.k.a "slots") for those named entities. In addition to end-to-end KB construction, the Cold Start track offers two diagnostic tasks: Entity Discovery (ED) and Slot Filling (SF). The Cold Start entity discovery task is to create a KB node for each PER, ORG, and GPE entity mentioned by name in the Cold Start collection of text, and to link all named mentions of the entity to its KB node. The Cold Start slot filling task is to search the same document collection to fill in values for specific slots for specific entities.

The Tri-lingual Entity Discovery and Linking (EDL) track aims to extract entity mentions from a source collection of textual documents in multiple languages (English, Chinese and Spanish), and link them to an existing Knowledge Base (KB). An EDL system is also required to cluster mentions for those NIL entities that don't have corresponding KB entries.

The event track in KBP 2015 aims to extract information about events from unstructured text, such that the information would be suitable as input into a structured KB.

Finally, the slot filler validation task in the validation/ensembling track focuses on the refinement of output from slot filling systems by either combining information from multiple slot filling systems, or applying more intensive linguistic processing to validate candidate slot fillers.

Tracks

Cold Start KBP Track
The Cold Start KBP track builds a knowledge base from scratch using a given document collection and a predefined schema for the entities and relations that will comprise the KB. Cold Start KBP includes an Entity Discovery (ED) task to discover entities and cluster their named mentions, and a Slot Filling (SF) task to fill in values for predefined slots (attributes) for a given entity.
Track home page: http://tac.nist.gov/2015/KBP/ColdStart/
Track coordinators: James Mayfield ([email protected]) and Ralph Grishman ([email protected])

Tri-Lingual Entity Discovery and Linking Track (EDL)
The Tri-Lingual Entity Discovery and Linking (EDL) track aims to extract entity mentions from a source collection of textual documents in multiple languages (English, Chinese, and Spanish), and link them to an existing Knowledge Base (KB); an EDL system is also required to cluster mentions for those entities that don't have corresponding KB entries.
Track home page: http://nlp.cs.rpi.edu/kbp/2015/
Track coordinator: Heng Ji ([email protected])

Event Track
The goal of the Event track is to extract information about events such that the information would be suitable as input to a knowledge base. The track includes Event Nugget (EN) tasks to detect and link events, and Event Argument (EA) tasks to extract event arguments and link arguments that belong to the same event.
Track home page: http://tac.nist.gov/2015/KBP/Event/
Event Nugget coordinators: Eduard Hovy ([email protected]) and Teruko Mitamura ([email protected])
Event Argument coordinator: Marjorie Freedman ([email protected])

Validation/Ensembling Track
The Validation/Ensembling track focuses on the refinement of output from slot filling systems by either combining information from multiple slot filling systems, or applying more intensive linguistic processing to validate individual candidate slot fillers.
Track home page: http://tac.nist.gov/2015/KBP/SFValidation/
Track coordinator: Hoa Dang ([email protected])

Preliminary Schedule

Preliminary TAC KBP 2015 Schedule
April 15	Track registration opens
Mid April	Initial track guidelines posted
June 30	Deadline for registration for track participation
August - October	Track evaluation windows (varies by track)
August 3-31	Cold Start KBP evaluation window
August 17-24	Event (Argument Extraction and Linking) evaluation window
August 31-Sept 7	Event (Argument Verification and Linking) evaluation window
Sept 8-21	Event (Nugget Detection) evaluation window
Sept 8-21	Event (Nugget Detection and Coreference) evaluation window
Sept 22-29	Event (Nugget Coreference) evaluation window
Sept 21-Oct 12	Slot Filler Validation/Ensembling evaluation window
Sept 28-Oct 12	Tri-Lingual EDL evaluation window
Oct 14-19	Diagnostic Tri-Lingual EL evaluation window
By mid October	Release of individual evaluated results to participants (most tracks)
October 20	Deadline for short system descriptions
October 20	Deadline for workshop presentation proposals
October 25	Notification of acceptance of presentation proposals
Nov 8	Deadline for system reports (workshop notebook version)
November 16-17	TAC 2015 workshop in Gaithersburg, Maryland, USA
March 5, 2016	Deadline for system reports (final proceedings version)

Organizing Committee

Hoa Trang Dang (U.S. National Institute of Standards and Techonology)
Jason Duncan (MITRE)
Joe Ellis (Linguistic Data Consortium)
Marjorie Freedman (BBN Technologies)
Ralph Grishman (New York University)
Eduard Hovy (Carnegie Mellon University)
Heng Ji (Rensselaer Polytechnic Institute)
James Mayfield (Johns Hopkins University)
Teruko Mitamura (Carnegie Mellon University)
Boyan Onyshkevych (U.S. Department of Defense)
Shahzad Rajput (U.S. National Institute of Standards and Techonology)
Zhiyi Song (Linguistic Data Consortium)
Stephanie Strassel (Linguistic Data Consortium)