thesis topics — speech & audio
Speech Recognition
Speech Therapy
Audio Detection & Classification
Music
Automatic monitoring of communication skills
Automatic tool to monitor communication style
Your browser does not support the video tag.
Based on process communication model (PCM)
In collaboration with Communicate 2 Connect
Task: apply speech recognition technology for PCM:
non-verbal information (emotion, prosody)
key-words, key-phrases → frequency counts
spontaneous, noisy audio
Automatic monitoring of communication skills
Towards a self-learning speech recognition system
Current machine learning techniques learn from labelled data
Can only learn relations prepared by human annotators → costly
Example: speech recognition
Works well for standard, prepared speech
Fails on new words, regiolects, dialects, hesitations, borken-off words, ...
Your browser does not support the video tag.
Aim: self-learning speech recognizer; weakly supervised
Towards a self-learning speech recognition system
Automatic classification & evaluation of speech disorders
Speech therapy for patients with severe speech impairment: laryngectomy, dysarthria
Tool (website ) to automate voice training (in collaboration with speech therapists)
Web-site: recording (audio) + manual annotations (annotations) + automatic feedback
Machine Learning for Vocal Activity Detection
Detect when somebody sings ↔ instruments only
Non-trivial problem
→ deep learning
Developing an automatic DJ system