Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
- Tipo di materiale
- Termini di utilizzo
- Destinatari
- Aree tematiche
- Tags
- Lingue
- Formati dei media
- Caratteristiche di accessibilità
- Tipo di OER
- Metadati e documento/i
- Repository di origine
Proposto da
Valeria Cervetti
19/04/2017
nel progetto Traduzione audiovisiva e web
ultima modifica 19/04/2017
- Valutazioni
- Nessuna valutazione
Per favore accedi per aggiungere una valutazione.
Nessun commento.
Per favore accedi per lasciare un commento.