Journal on audio speech and music processing文章索引

Diffraction perception in L-shaped rooms using virtual reality [0.03%] 基于虚拟现实的L型房间中声波衍射的听觉感知研究

Joshua Mannall,Annika Neidhardt,Paul Calamia et al. Joshua Mannall et al.

Outside of shoebox rooms, acoustic diffraction phenomena are present and can influence important aspects of auditory perception, such as localisation. A simple extension of a shoebox room is an L-shaped room as it introduces a single diffra...

EURASIP journal on audio, speech, and music processing. 2026;2026(1):7. DOI:10.1186/s13636-025-00433-2 2026

Robust and early howling detection based on a sparsity measure [0.03%] 基于稀疏度量的鲁棒性和早期啸叫检测

Mina Mounir,Giuliano Bernardi,Toon van Waterschoot Mina Mounir

Despite recent advances in audio technology, acoustic feedback remains a problem encountered in many sound reinforcement applications, ranging from public address systems to hearing aids. Acoustic feedback occurs due to the acoustic couplin...

EURASIP journal on audio, speech, and music processing. 2025;2025(1):14. DOI:10.1186/s13636-025-00399-1 2025

Singing to speech conversion with generative flow [0.03%] 生成流的唱歌到说话的转换

Jiawen Huang,Emmanouil Benetos Jiawen Huang

This paper introduces singing to speech conversion (S2S), a cross-domain voice conversion task, and presents the first deep learning-based S2S system. S2S aims to transform singing into speech while retaining the phonetic information, reduc...

EURASIP journal on audio, speech, and music processing. 2025;2025(1):12. DOI:10.1186/s13636-025-00400-x 2025

Steered Response Power for Sound Source Localization: a tutorial review [0.03%] 定向响应功率声音定位方法综述tutorial论文

Eric Grinstein,Elisa Tengan,Bilgesu Çakmak et al. Eric Grinstein et al.

In the last three decades, the Steered Response Power (SRP) method has been widely used for the task of Sound Source Localization (SSL), due to its satisfactory localization performance on moderately reverberant and noisy scenarios. Many wo...

Review EURASIP journal on audio, speech, and music processing. 2024;2024(1):59. DOI:10.1186/s13636-024-00377-z 2024

A framework for the acoustic simulation of passing vehicles using variable length delay lines [0.03%] 基于可变长度延时线的通行车辆声学仿真框架

Stefano Damiano,Luca Bondi,Andre Guntoro et al. Stefano Damiano et al.

The sound produced by vehicles driving on roadways constitutes one of the dominant noise sources in urban areas. The impact of traffic noise on human activities and the related investigation on modeling, assessment, and abatement strategies...

EURASIP journal on audio, speech, and music processing. 2024;2024(1):49. DOI:10.1186/s13636-024-00372-4 2024

Compression of room impulse responses for compact storage and fast low-latency convolution [0.03%] 压缩房间脉冲响应以实现紧凑存储和快速低延迟卷积

Martin Jälmby,Filip Elvander,Toon van Waterschoot Martin Jälmby

Room impulse responses (RIRs) are used in several applications, such as augmented reality and virtual reality. These applications require a large number of RIRs to be convolved with audio, under strict latency constraints. In this paper, we...

EURASIP journal on audio, speech, and music processing. 2024;2024(1):45. DOI:10.1186/s13636-024-00363-5 2024

Explicit-memory multiresolution adaptive framework for speech and music separation [0.03%] 一种用于语音和音乐分离的显式记忆多分辨率自适应框架

Ashwin Bellur,Karan Thakkar,Mounya Elhilali Ashwin Bellur

The human auditory system employs a number of principles to facilitate the selection of perceptually separated streams from a complex sound mixture. The brain leverages multi-scale redundant representations of the input and uses memory (or ...

EURASIP journal on audio, speech, and music processing. 2023;2023(1):20. DOI:10.1186/s13636-023-00286-7 2023

MYRiAD: a multi-array room acoustic database [0.03%] MYRiAD：多声学阵列房间声学数据库

Thomas Dietzen,Randall Ali,Maja Taseska et al. Thomas Dietzen et al.

In the development of acoustic signal processing algorithms, their evaluation in various acoustic environments is of utmost importance. In order to advance evaluation in realistic and reproducible scenarios, several high-quality acoustic da...

EURASIP journal on audio, speech, and music processing. 2023;2023(1):17. DOI:10.1186/s13636-023-00284-9 2023

Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy [0.03%] 基于监督机器学习的声乐教学中经典男高音独唱声音属性识别研究

Yanze Xu,Weiqing Wang,Huahua Cui et al. Yanze Xu et al.

Humans can recognize someone's identity through their voice and describe the timbral phenomena of voices. Likewise, the singing voice also has timbral phenomena. In vocal pedagogy, vocal teachers listen and then describe the timbral phenome...

EURASIP journal on audio, speech, and music processing. 2022;2022(1):8. DOI:10.1186/s13636-022-00240-z 2022

On the selection of the number of beamformers in beamforming-based binaural reproduction [0.03%] 基于波束成形的双耳渲染中选择波束形成器的数量研究

Itay Ifergan,Boaz Rafaely Itay Ifergan

In recent years, spatial audio reproduction has been widely researched with many studies focusing on headphone-based spatial reproduction. A popular format for spatial audio is higher order Ambisonics (HOA), where a spherical microphone arr...

EURASIP journal on audio, speech, and music processing. 2022;2022(1):6. DOI:10.1186/s13636-022-00238-7 2022