Enter title, abstract and co-authors, and upload a lastname.txt file (can be empty or contain additional information regarding the submission).Choose “Create new submission” in the Author Console.After logging in, complete the following steps to submit the results: Please use Microsoft Conference Management Toolkit for submitting the results. The winners of this challenge will be selected based on the average Mean Opinion Score (MOS) achieved across all different single talk and double talk scenarios. We also open source an online subjective test framework and provide an online objective metric service for researchers to quickly test their results. These datasets consist of recordings from more than 5,000 real audio devices and human speakers in real environments, as well as a synthetic dataset.
![acoustic echo cancellation for voip acoustic echo cancellation for voip](https://www.manualsdir.com/manuals/300888/45/asus-xonar-dgx-page45.png)
In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios.
![acoustic echo cancellation for voip acoustic echo cancellation for voip](https://engineering.linecorp.com/wp-content/uploads/2015/07/voip-05.png)
Also, most of the conventional objective metrics such as echo return loss enhancement (ERLE) and perceptual evaluation of speech quality (PESQ) do not correlate well with subjective speech quality tests in the presence of background noise and reverberation found in realistic environments. However, the AEC performance often degrades significantly on real recordings.
![acoustic echo cancellation for voip acoustic echo cancellation for voip](https://www.soundandcommunications.com/wp-content/uploads/2015/10/Audio-waves.jpg)
Many recent AEC studies report good performance on synthetic datasets where the training and testing data come from the same underlying distribution. The INTERSPEECH 2021 Acoustic Echo Cancellation Challenge is intended to stimulate research in the area of acoustic echo cancellation (AEC), which is an important part of speech enhancement and still a top issue in audio communication and conferencing systems.