===================================================================== TAC KBP 2015 EVENT ARGUMENT EXTRACTION AND LINKING EVALUATION RESULTS ===================================================================== Team ID: OSU Organization: Oregon State University Run ID: OSU1 Did the run access the live Web during the evaluation window: No Did the run perform any cross-sentence reasoning: No Did the run use any distributed representations (e.g., of words): No Did the run return meaningful confidence values: No ************************************************************* ### The following are scores for from the TAC 2015 Event Argument and Linking Evaluation. ### For all scoring breakdowns, the summaries report: Precision, Recall, F1, EAArg Score, and Overall score. ### Details of the scoring and the scoring software can be found on the TAC 2015 EAL webpage. ### ### Scores are reported on the full data set (all_genre) and broken down by genre-- discussion forum only(df) newswire only(nw). ### ### The official score (withRealis) incorporates the correctness of the (ACTUAL, GENERIC, and OTHER) distinction ### and the correctness of canonical argument string resolution. As a diagnostic, we also report (a) a score ### that ignores the realis distinction (neutralizeRealis) and (b) a score that ignores both the realis distinction ### and canonical argument string resolution(neutraliseRealisCoref). ### ### Scores are reported over two data sets. Dataset1 (all_event_types), consists of 81 documents assessed for the ### full TAC EAL event taxonomy as specified in the 2015 evaluation plan. Dataset 2(restricted_event_types), ### consists of 201 documents assessed for only 6 event types (assertions outside of the 6 were ignored). Dataset2 ### includes the documents in Dataset1. Dataset 2 was assessed to allow a more in depth evaluation of event-specific ### performance (and variance across performance by event type). The 6 event types included in Dataset2 are: ### - Transaction.Transfer-Money ### - Movement.Transport-Artifact ### - Life.Marry ### - Contact.Meet ### - Conflict.Demonstrate ### - Conflict.Attack ### ### One participant (ZJU) submitted an submission an offset error. This system output was automatically fixed by BBN (the organizer) and ### the system by ZJU (the participant). Because the modifications were different, both numbers are reported. ### ### One participant (ver-CMU) participated in "verification" version of the task. This system took as its input all ### other system submissions. This submission included the ZJU submission which had broken offsets and ### did not include either BBN's fix or ZJU's fix. Thus it is not comparable to the other systems in task performed. ### ### The LDC submission was produced with an LDC annotator spending 45-60 minutes on the task of extracting arguments ### and grouping them. The low recall of the LDC submission is due at least in part to the time limitation. ### ### While all scores provide interesting diagnostic information, the "official" evaluation metric is Dataset1(all_event_types) on the ### both genres (all_genre) using the official(withRealis) metric. #################################### ###### All Event Types ###### ####### Genre: all_genre ####### ##### Scoring Configuration: withRealis ##### submission P R F1 EAArg EALink Overall OSU1 24.0 14.8 18.3 6.0 5.4 5.7 ##### Scoring Configuration: neutralizeRealisCoref ##### submission P R F1 EAArg EALink Overall OSU1 45.3 27.9 34.5 19.6 13.7 16.6 ##### Scoring Configuration: neutralizeRealis ##### submission P R F1 EAArg EALink Overall OSU1 28.8 18.4 22.5 8.3 6.3 7.3 ####### Genre: df ####### ##### Scoring Configuration: withRealis ##### submission P R F1 EAArg EALink Overall OSU1 27.9 15.4 19.8 7.3 6.8 7.1 ##### Scoring Configuration: neutralizeRealisCoref ##### submission P R F1 EAArg EALink Overall OSU1 46.5 25.1 32.6 18.4 13.9 16.1 ##### Scoring Configuration: neutralizeRealis ##### submission P R F1 EAArg EALink Overall OSU1 32.1 18.0 23.1 9.4 7.2 8.3 ####### Genre: nw ####### ##### Scoring Configuration: withRealis ##### submission P R F1 EAArg EALink Overall OSU1 21.9 14.5 17.4 5.2 4.5 4.8 ##### Scoring Configuration: neutralizeRealisCoref ##### submission P R F1 EAArg EALink Overall OSU1 44.7 29.5 35.5 20.4 13.5 17.0 ##### Scoring Configuration: neutralizeRealis ##### submission P R F1 EAArg EALink Overall OSU1 27.1 18.6 22.1 7.5 5.8 6.7 #################################### ###### Restricted Event Types ###### ####### Genre: all_genre ####### ##### Scoring Configuration: withRealis ##### submission P R F1 EAArg EALink Overall OSU1 19.3 11.7 14.6 5.9 3.6 4.7 ##### Scoring Configuration: neutralizeRealisCoref ##### submission P R F1 EAArg EALink Overall OSU1 35.9 22.0 27.3 15.4 8.5 11.9 ##### Scoring Configuration: neutralizeRealis ##### submission P R F1 EAArg EALink Overall OSU1 24.0 15.0 18.5 7.8 4.3 6.1 ####### Genre: df ####### ##### Scoring Configuration: withRealis ##### submission P R F1 EAArg EALink Overall OSU1 19.2 10.9 13.9 5.7 3.0 4.3 ##### Scoring Configuration: neutralizeRealisCoref ##### submission P R F1 EAArg EALink Overall OSU1 36.6 20.8 26.5 15.1 7.8 11.5 ##### Scoring Configuration: neutralizeRealis ##### submission P R F1 EAArg EALink Overall OSU1 25.2 14.7 18.6 8.3 3.6 6.0 ####### Genre: nw ####### ##### Scoring Configuration: withRealis ##### submission P R F1 EAArg EALink Overall OSU1 19.3 12.3 15.0 6.1 4.0 5.0 ##### Scoring Configuration: neutralizeRealisCoref ##### submission P R F1 EAArg EALink Overall OSU1 35.4 22.9 27.8 15.6 9.0 12.3 ##### Scoring Configuration: neutralizeRealis ##### submission P R F1 EAArg EALink Overall OSU1 23.0 15.2 18.3 7.4 4.9 6.2