Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
- Type of material
- Terms of use
- Target audience
- Subject areas
- Tags
- Languages
- Media formats
- Accessibility features
- OER type
- Metadata and document(s)
- Source repository
Submitted by
Valeria Cervetti
19/04/2017
in the project Audiovisual Translation for the Web
last updated 19/04/2017
- Evaluations
- No evaluation
Please log in to add evaluation.
No comments yet.
Please log in to leave a comment.