TAC 2019 Tracks
SM-KBP
Guidelines
Data
Tools
EDL
DDI
Call for Participation
Track Registration
Reporting Guidelines
TAC 2019 Workshop
|
|
Streaming Multimedia Knowledge Base Population (SM-KBP) 2019
Evaluation: February-November, 2019
Workshop: November 12-13, 2019
Conducted by:
U.S. National Institute of Standards and Technology (NIST)
With support from:
U.S. Department of Defense
Background
In scenarios such as natural disasters or international conflicts,
analysts and the public are often confronted with a variety of
information coming through multiple media sources. There is a need for
technologies to analyze and extract knowledge from multimedia to
develop and maintain an understanding of events, situations, and
trends as they unfold around the world.
The goal of DARPA's Active Interpretation of Disparate Alternatives
(AIDA) Program is to develop a multi-hypothesis semantic engine that
generates explicit alternative interpretations of events, situations,
and trends from a variety of unstructured sources, for use in noisy,
conflicting, and potentially deceptive information environments. This
engine must be capable of mapping knowledge elements (KE)
automatically derived from multiple media sources into a common
semantic representation, aggregating information derived from those
sources, and generating and exploring multiple hypotheses about the
events, situations, and trends of interest. This engine must establish
confidence measures for the derived knowledge and hypotheses, based on
the accuracy of the analysis and the coherence of the semantic
representation of each hypothesis.
The streaming multimedia KBP track will assess the performance of
systems that have been developed in support of AIDA program goals.
Systems will be asked to extract knowledge elements from a stream of
heterogeneous documents containing multilingual multimedia sources
including text, speech, images, videos, and pdf files; aggregate the
knowledge elements from multiple documents without access to the raw
documents themselves (maintaining multiple interpretations and
confidence values for KEs extracted or inferred from the documents);
and develop semantically coherent hypotheses, each of which represents
an interpretation of the document stream.
The SM-KBP tasks will be run at TAC/TRECVID 2019. After the 2018 pilot, it is expected that the SM-KBP track will be
run for 3 evaluation cycles:
- Pilot Evaluation: September-October 2018
- Evaluation 1 (short cycle): June-August 2019
- Evaluation 2 (18-month cycle): August-September 2020
- Evaluation 3 (18-month cycle): March-April 2022
Overview
SM-KBP evaluation is over a small set of topics for a single scenario.
There will be a new scenario and related set of languages for each
evaluation cycle. For the 2018 pilot and 2019 evaluation, the scenario is the
Russian/Ukrainian conflict (2014-2015) and the scenario languages are
English, Russian, and Ukrainian. Early in the evaluation cycle, all
task participants will receive an ontology of entities, events, event
arguments, relations, and SEC (sentiment, emotion, and cognitive
state), defining the KEs that are in scope for the evaluation
tasks.
The SM-KBP track has three main evaluation tasks:
- Task 1: Extraction of KEs and KE mentions from a stream of multi-media documents, including linking of mentions of the same KE within each document to produce a document-level knowledge graph for each document. Extraction and linking will be conditioned on two kinds of contexts:
- a) generic background context
- b) generic background context plus a "what if" hypothesis
- Task 2: Construction of a KB by aggregating and linking document-level knowledge graphs produced by one or more Task 1 teams.
- Task 3: Generation of hypotheses from KBs produced by one or more Task 2 teams.
Tasks 1a and 2 are open to all researchers who find the evaluation
tasks of interest. Tasks 1b and 3 and limited to teams that are part
of DARPA's AIDA program.
The source corpus for the 2019 evaluation will comprise approximately 2000
English, Russian, and Ukrainian documents. Systems in Task 1 will
operate on the 2000 documents in the source corpus; systems in Task 2
will operate on the output of one or more systems from Task 1a and
will not have access to the source documents; systems in Task 3
will operate on the output of one or more systems from Task 2, and
also will not have access to the source documents. There are
many use cases in which analytic engines cannot have access to
original documents; for example, provenance for an assertion may have
never been recorded in the first place, or provenance may need to be
redacted for legal or security reasons.
Novel characteristics of the open evaluation tasks (Task 1a and Task 2) include:
- Task 1: Multimodal multilingual extraction and linking of information within a document
- Task 1 and 2: Processing of streaming input
- Task 1 and 2: Confidence estimation and maintenance of multiple possible interpretations
- Task 2: Cross-document aggregation and linking of information without access to original documents
Novel characteristics of the AIDA program-internal evaluation tasks (Task 1b and Task 3) include:
- Document-level extraction and linking conditioned on "feedback hypotheses" providing context.
- Generation of semantically coherent hypotheses, each representing a different interpretation of the document stream.
Schedule
TAC SM-KBP 2019 Schedule |
June 15 | Deadline for registration for track participation |
June 27 (7:00AM EDT) - July 3 (11:59PM EDT) | Task 1a Evaluation Window |
July 8 (7:00AM EDT) - July 14 (11:59PM EDT) | Task 1b Evaluation Window |
July 8 (7:00AM EDT) - July 14 (11:59PM EDT) | Task 2 Evaluation Window |
August 5 (7:00AM EDT) - August 11 (11:59PM EDT) | Task 3a and 3b Evaluation Window |
September 15 | Deadline for short system descriptions |
September 15 | Deadline for workshop presentation proposals |
Early October | Release of partial preliminary evaluated results to participants |
Early October | Notification of acceptance of presentation proposals |
November 1 | Deadline for system reports (workshop notebook version) |
November 12-13 | TAC 2019 workshop in Gaithersburg, Maryland, USA |
March 1, 2020 | Deadline for system reports (final proceedings version) |
Mailing List
Join the sm-kbp group to subscribe yourself to the sm-kbp@list.nist.gov mailing list (if not already subscribed):
Registering to participate in a track does not automatically
add you to the mailing list. If you were previously subscribed to the
mailing list, you do not have to re-subscribe (the mailing list is for
anyone interested in SM-KBP, rather than specifically for SM-KBP
participants, and thus carries over from year to year).
Organizing Committee
Hoa Trang Dang (U.S. National Institute of Standards and Techonology)
Oleg Aulov (U.S. National Institute of Standards and Techonology)
George Awad (U.S. National Institute of Standards and Techonology)
Asad Butt (U.S. National Institute of Standards and Techonology)
Shahzad Rajput (U.S. National Institute of Standards and Techonology)
Jason Duncan (MITRE)
Boyan Onyshkevych (U.S. Department of Defense)
Stephanie Strassel (Linguistic Data Consortium)
Jennifer Tracey (Linguistic Data Consortium)
|