Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

view resource

This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.

Type of material
Terms of use
Target audience
Subject areas
Tags
Languages
Media formats
Accessibility features
OER type
Metadata and document(s)
Source repository
Attached documents
37501-141192-1-PB.pdf       

Submitted by Valeria Cervetti
19/04/2017
in the project Audiovisual Translation for the Web

last updated 19/04/2017

Original editing language: Italiano
Evaluations
No evaluation

Please log in to add evaluation.

Comments

No comments yet.

Please log in to leave a comment.