Accuracy rate in live subtitling – the NER model

view resource

For some time now, subtitling companies have been providing broadcasters with regular data on the accuracy of their live subtitles. In some cases this is actually a contractual obligation, as companies are required to obtain a given accuracy rate in their subtitles. However, given that accuracy calculations vary greatly between countries and even companies, the question arises of whether we are effectively comparing incomparable data. The aim of this paper is, first of all, to review different accuracy models used around the world. Then, using the model put forward in Romero-Fresco (2011) as a starting point, a new model is presented to assess the accuracy of live subtitles in different countries and in different languages by analyzing the extent to which errors affect the coherence of the subtitled text or modify its content. Real-life examples in English, Spanish, Italian and German obtained from a corpus of 35,000 words are included. The focus is placed on respoken subtitles, which are nowadays the most common type of live subtitles. However, the model is also applicable to automatic subtitles, since, given the rapid progress of speech recognition technology, they are likely to be introduced in the near future. The model is currently being used by regulators, broadcasters, companies and training institutions in Spain, France, Italy, Switzerland, Germany, Belgium and Australia, among other countries.

Type of material
Terms of use
Target audience
Subject areas
Tags
Languages
Media formats
Accessibility features
OER type
Metadata and document(s)
Attached documents
NER-English.pdf       

Submitted by Valeria Cervetti
04/11/2016
in the project Audiovisual Translation for the Web

last updated 04/11/2016

Original editing language: Italiano
Evaluations
No evaluation

Please log in to add evaluation.

Comments

No comments yet.

Please log in to leave a comment.