=============================================================================== TAC KBP 2016 SLOT FILLER VALIDATION CROSS-LINGUAL ENSEMBLING EVALUATION RESULTS =============================================================================== Team ID: gator_dsr Organization: University of Florida ************************************************************* Run ID: gator_dsr1 Did the run access the live Web during the evaluation window: No Did this run judge each candidate slot filler independently of all other candidate slot fillers in the evaluation dataset: No Did this run judge candidate slot fillers for each slot-filling run independently of all other slot-filling runs in the evaluation dataset: No Did this run judge candidate slot fillers for each slot-filling team independently of all other slot-filling teams in the evaluation dataset: No Did this run make use of the slot filler or justification offsets provided for each candidate slot filler: Yes Did this run make use of the confidence values provided for each candidate slot filler: Yes Did this run make use of the system profiles for the slot filling runs: No Did this run make use of the preliminary assessments provided for some of the slot filler candidates: No CSSF micro-average Precision, Recall, and F1, at each hop level: hop0_P hop0_R hop0_F hop1_P hop0_R hop1_F ALL_P ALL_R ALL_F gator_dsr1.XLING.ensemble 0.6089 0.0308 0.0587 0.7778 0.0040 0.0080 0.6170 0.0219 0.0424 hltcoe_KB_XLING_1 (best hop0 F1 input) 0.4158 0.2110 0.2800 0.1875 0.1828 0.1851 0.3045 0.2017 0.2426 hltcoe_KB_XLING_1 (best hop1 F1 input) 0.4158 0.2110 0.2800 0.1875 0.1828 0.1851 0.3045 0.2017 0.2426 hltcoe_KB_XLING_1 (best ALL F1 input) 0.4158 0.2110 0.2800 0.1875 0.1828 0.1851 0.3045 0.2017 0.2426 MAX micro-average Precision, Recall, and F1, at each hop level: hop0_P hop0_R hop0_F hop1_P hop0_R hop1_F ALL_P ALL_R ALL_F gator_dsr1.XLING.ensemble 0.6850 0.0829 0.1478 0.8333 0.0092 0.0182 0.6917 0.0577 0.1065 hltcoe_KB_XLING_1 (best hop0 F1 input) 0.4072 0.3029 0.3474 0.1494 0.2335 0.1822 0.2728 0.2792 0.2760 hltcoe_KB_XLING_3 (best hop1 F1 input) 0.4344 0.2838 0.3433 0.2151 0.1985 0.2065 0.3418 0.2547 0.2919 hltcoe_KB_XLING_3 (best ALL F1 input) 0.4344 0.2838 0.3433 0.2151 0.1985 0.2065 0.3418 0.2547 0.2919 MEAN macro-average Precision, Recall, and F1, at each hop level: hop0_P hop0_R hop0_F hop1_P hop0_R hop1_F ALL_P ALL_R ALL_F gator_dsr1.XLING.ensemble 0.0954 0.0784 0.0809 0.0102 0.0051 0.0058 0.0526 0.0416 0.0432 hltcoe_KB_XLING_1 (best hop0 F1 input) 0.1648 0.1588 0.1522 0.1206 0.1466 0.1246 0.1426 0.1527 0.1383 hltcoe_KB_XLING_1 (best hop1 F1 input) 0.1648 0.1588 0.1522 0.1206 0.1466 0.1246 0.1426 0.1527 0.1383 hltcoe_KB_XLING_1 (best ALL F1 input) 0.1648 0.1588 0.1522 0.1206 0.1466 0.1246 0.1426 0.1527 0.1383