Question Answering as an Automatic Evaluation Metric for News Article Summarization Matan Eyal1, 2, Tal Baumel1, 3, Michael Elhadad1 1Dept. Computer Science, Ben Gurion University 2IBM Research, Israel, 3Microsoft fmataney,
[email protected],
[email protected] Abstract See et al.(2017)’s Summary: bolton will offer new contracts to emile heskey, 37, eidur gudjohnsen, 36, and adam bogdan, 27. Recent work in the field of automatic sum- heskey and gudjohnsen joined on short-term deals in december. eidur gudjohnsen has scored five times in the championship . marization and headline generation focuses on APES score: 0.33 maximizing ROUGE scores for various news Baseline Model Summary (Encoder / Decoder / Attention / datasets. We present an alternative, extrin- Copy / Coverage): bolton will offer new contracts to emile hes- sic, evaluation metric for this task, Answering key, 37, eidur gudjohnsen, 36, and goalkeeper adam bogdan, 27. Performance for Evaluation of Summaries. heskey and gudjohnsen joined on short-term deals in december, and have helped neil lennon ’s side steer clear of relegation. ei- APES utilizes recent progress in the field of dur gudjohnsen has scored five times in the championship, as reading-comprehension to quantify the ability well as once in the cup this season . of a summary to answer a set of manually cre- APES score: 0.33 ated questions regarding central entities in the Our Model (APES optimization): bolton will offer new con- source article. We first analyze the strength tracts to emile heskey, 37, eidur gudjohnsen, 36, and goalkeeper adam bogdan, 27. heskey joined on short-term deals in decem- of this metric by comparing it to known man- ber, and have helped neil lennon ’s side steer clear of relegation.