Mina Mounir,Giuliano Bernardi,Toon van Waterschoot
Mina Mounir
Despite recent advances in audio technology, acoustic feedback remains a problem encountered in many sound reinforcement applications, ranging from public address systems to hearing aids. Acoustic feedback occurs due to the acoustic couplin...
Jiawen Huang,Emmanouil Benetos
Jiawen Huang
This paper introduces singing to speech conversion (S2S), a cross-domain voice conversion task, and presents the first deep learning-based S2S system. S2S aims to transform singing into speech while retaining the phonetic information, reduc...
Steered Response Power for Sound Source Localization: a tutorial review [0.03%]
定向响应功率声音定位方法综述tutorial论文
Eric Grinstein,Elisa Tengan,Bilgesu Çakmak et al.
Eric Grinstein et al.
In the last three decades, the Steered Response Power (SRP) method has been widely used for the task of Sound Source Localization (SSL), due to its satisfactory localization performance on moderately reverberant and noisy scenarios. Many wo...
A framework for the acoustic simulation of passing vehicles using variable length delay lines [0.03%]
基于可变长度延时线的通行车辆声学仿真框架
Stefano Damiano,Luca Bondi,Andre Guntoro et al.
Stefano Damiano et al.
The sound produced by vehicles driving on roadways constitutes one of the dominant noise sources in urban areas. The impact of traffic noise on human activities and the related investigation on modeling, assessment, and abatement strategies...
Compression of room impulse responses for compact storage and fast low-latency convolution [0.03%]
压缩房间脉冲响应以实现紧凑存储和快速低延迟卷积
Martin Jälmby,Filip Elvander,Toon van Waterschoot
Martin Jälmby
Room impulse responses (RIRs) are used in several applications, such as augmented reality and virtual reality. These applications require a large number of RIRs to be convolved with audio, under strict latency constraints. In this paper, we...
Explicit-memory multiresolution adaptive framework for speech and music separation [0.03%]
一种用于语音和音乐分离的显式记忆多分辨率自适应框架
Ashwin Bellur,Karan Thakkar,Mounya Elhilali
Ashwin Bellur
The human auditory system employs a number of principles to facilitate the selection of perceptually separated streams from a complex sound mixture. The brain leverages multi-scale redundant representations of the input and uses memory (or ...
Thomas Dietzen,Randall Ali,Maja Taseska et al.
Thomas Dietzen et al.
In the development of acoustic signal processing algorithms, their evaluation in various acoustic environments is of utmost importance. In order to advance evaluation in realistic and reproducible scenarios, several high-quality acoustic da...
Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy [0.03%]
基于监督机器学习的声乐教学中经典男高音独唱声音属性识别研究
Yanze Xu,Weiqing Wang,Huahua Cui et al.
Yanze Xu et al.
Humans can recognize someone's identity through their voice and describe the timbral phenomena of voices. Likewise, the singing voice also has timbral phenomena. In vocal pedagogy, vocal teachers listen and then describe the timbral phenome...
On the selection of the number of beamformers in beamforming-based binaural reproduction [0.03%]
基于波束成形的双耳渲染中选择波束形成器的数量研究
Itay Ifergan,Boaz Rafaely
Itay Ifergan
In recent years, spatial audio reproduction has been widely researched with many studies focusing on headphone-based spatial reproduction. A popular format for spatial audio is higher order Ambisonics (HOA), where a spherical microphone arr...
Mina Mounir,Peter Karsmakers,Toon van Waterschoot
Mina Mounir
If music is the language of the universe, musical note onsets may be the syllables for this language. Not only do note onsets define the temporal pattern of a musical piece, but their time-frequency characteristics also contain rich informa...