Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? [OER]

Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.

view resource

Type of material

Terms of use

Target audience

Subject areas

Tags

Languages

Media formats

Accessibility features

OER type: Metadata and document(s)

Source repository

Attached documents: 37501-141192-1-PB.pdf

Submitted by Valeria Cervetti
19/04/2017
in the project Audiovisual Translation for the Web