2021/06/30 | Media | Artificial Intelligence

AI improves speech understanding of hearing aid users

Hearing aid or hearing implant users find it challenging to understand their conversational partner in noisy environments because current audio processors still insufficiently allow them to focus on specific sound sources. In a feasibility study, researchers from the Hearing Research Laboratory (HRL) at the University of Bern and the Inselspital are now suggesting that artificial intelligence could solve this problem.

Hearing aids and implants typically use two microphones in close proximity to help users filter out background noises and direct attention to a conversation. Adding a third virtual microphone, estimated by a neural network as placed on the forehead, can significantly improve this effect for hearing aid and cochlear implant users in “cocktail party” scenarios. (https://doi.org/10.1016/j.heares.2021.108294 © ARTORG Center)

Hearing aids or hearing implants are currently not very good at selectively filtering specific speech from many sound sources for the wearer – a natural ability of the human brain and sense of hearing known in audiology as the “cocktail party effect”. Accordingly, it is difficult for hearing aid users to follow a conversation in a noisy environment. Researchers at the Hearing Research Laboratory of the ARTORG Center, University of Bern, and Inselspital have now devised an unusual approach to improve hearing aids in this respect: virtual auxiliary microphones whose signals are calculated by artificial intelligence. 

Like an invisible microphone on the forehead 

The more microphones are available and the more widely they are distributed, the better a hearing aid can focus on sound from a particular direction. Most hearing aids have two microphones close together due to lack of space. In the first part of the study, the Hearing Research Laboratory (HRL) determined that the optimal microphone location (for better focusing) is in the middle of the forehead – though this is a very impractical location. “We wanted to get around this problem by adding a virtual microphone to the audio processor using artificial intelligence,” said Tim Fischer, a postdoctoral researcher at HRL, explaining this unconventional approach. 

Cocktail party data, a neural network, a subject test 

For the study setup, ARTORG Center engineers used the “Bern Cocktail Party Dataset”, a collection of a variety of noise scenarios with multiple sound sources from multi-microphone recordings of hearing aid or cochlear implant users. Using 65 hours of audio recordings (more than 78,000 audio files), they trained a neural network to refine a commonly used directionality algorithm (beamformer). For improved speech understanding the deep learning approach calculated additional virtual microphone signals from the audio data mixture. 20 subjects tested the AI-enhanced hearing in a subjective hearing test accompanied by objective measurements. Particularly in cocktail party settings, the virtually sampled microphone signals significantly improved the speech quality. Hearing aid and cochlear implant users could therefore benefit from the presented approach, especially in noisy environments.

"I think that artificial intelligence represents an important contribution to the next generation of hearing prostheses, as it has great potential for improving speech understanding, especially in difficult listening situations," says Marco Caversaccio, Chief Physician and ENT Department Head. As auditory assistive technologies and implants are a major focus of research at the Inselspital, important data-based foundations are being laid here for further development that should bring the natural hearing experience closer. The novel approaches will directly benefit patients within the framework of translational studies. 


Although within this study the virtually added microphones significantly improved the quality of speech understanding with hearing aids, further studies still need to overcome some technical hurdles before the methodology can be used in hearing aids or cochlear implant audio processors. This includes, for example, a stable functioning directional understanding even in reverberant environments.