OFAI

Technical Reports - Query Results

Your query term was 'number = 2007-11'
1 report found
OFAI-TR-2007-11 ( 394kB PDF file)

Phonetic Segmentation of the GEMEP-Corpus: Applying Forced Alignment on Emotional Speech

Hannes Pirker

This report documents the efforts of applying MFCC based Hidden Markov Models for the task of phonetic segmentation of emotional speech. The samples of emotional speech were taken from the Geneva Multimodal Emotion Portrayals (GEMEP) corpus. This multimodal corpus of acted emotional utterances provides data with highly uniform and controlled lexical content, and thus offers a promising basis for further systematic studies, especially on the acoustic properties of emotional speech as well as on the temporal relationship between speech, gestures and facial expressions. The phonetic segmentation on the level of phonemes described in this report offers a solid basis for all kinds of further investigations of temporal properties of multimodal emotional data. The report provides a description of the technical lay-out of the automatic alignment procedure, observations on peculiarities of the data and an evaluation of the obtained quality of the segmentation.

Keywords: Automatic Alignment, Phonetic Segmentation, Emotional Speech

Citation: Pirker H.: Phonetic Segmentation of the GEMEP-Corpus: Applying Forced Alignment on Emotional Speech. Technical Report, Österreichisches Forschungsinstitut für Artificial Intelligence, Wien, TR-2007-11, 2007