BLEU

|

BLEU was one of the first metrics to report high correlation with human judgements of quality. The metric is currently one of the most popular in the field. The central idea behind the metric is that “the closer a machine … Read More

Automatic Language Processing Advisory Committee (ALPAC)

|

One of the constituent parts of the ALPAC report was a study comparing different levels of human translation with machine translation output, using human subjects as judges. The human judges were specially trained for the purpose. The evaluation study compared … Read More