Multimodal Signal Processing

Audiovisual Speech Inversion

Audiovisual Speech Inversion

We focus on recovering aspects of vocal tract’s geometry and dynamics from speech, a problem referred to as speech inversion.

filby 2018-04-16T19:15:28+00:00 April 16th, 2018|

Audiovisual Speech Recognition

Audiovisual Speech Recognition

We have developed highly adaptive multimodal fusion rules based on uncertainty compensation which are compatible with synchronous and asynchronous multimodal interaction architectures.

filby 2018-04-16T19:14:08+00:00 April 16th, 2018|

Movie Summarization

Movie Summarization

Detection of perceptually important video events is formulated on the basis of saliency models for the audio, visual and textual information conveyed in a video stream.

filby 2018-04-16T19:14:03+00:00 April 16th, 2018|

Sign Language Recognition

Sign Language Recognition

We have developed a visual processing framework for hands and head tracking and feature extraction from SL/gesture videos.

filby 2018-04-16T19:13:54+00:00 April 16th, 2018|

Microphone array speech processing

Microphone array speech processing

We are working on microphone array processing and distant speech recognition, aiming to create hands-free, voice-enabled interfaces for home automation control.

filby 2018-04-16T19:13:45+00:00 April 16th, 2018|