Journal on audio speech and music processing文章索引

Musical note onset detection based on a spectral sparsity measure [0.03%] 基于频谱稀疏度量的音乐音符起始检测

Mina Mounir,Peter Karsmakers,Toon van Waterschoot Mina Mounir

If music is the language of the universe, musical note onsets may be the syllables for this language. Not only do note onsets define the temporal pattern of a musical piece, but their time-frequency characteristics also contain rich informa...

EURASIP journal on audio, speech, and music processing. 2021;2021(1):30. DOI:10.1186/s13636-021-00214-7 2021

End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network [0.03%] 基于一种新型上下文堆叠空洞卷积神经网络的端到端语音情感识别方法

Duowei Tang,Peter Kuppens,Luc Geurts et al. Duowei Tang et al.

Amongst the various characteristics of a speech signal, the expression of emotion is one of the characteristics that exhibits the slowest temporal dynamics. Hence, a performant speech emotion recognition (SER) system requires a predictive m...

EURASIP journal on audio, speech, and music processing. 2021;2021(1):18. DOI:10.1186/s13636-021-00208-5 2021

Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices [0.03%] 基于可穿戴设备环境音频的前景语音定位的深度多重实例学习方法研究

Rajat Hebbar,Pavlos Papadopoulos,Ramon Reyes et al. Rajat Hebbar et al.

Over the recent years, machine learning techniques have been employed to produce state-of-the-art results in several audio related tasks. The success of these approaches has been largely due to access to large amounts of open-source dataset...

EURASIP journal on audio, speech, and music processing. 2021;2021(1):7. DOI:10.1186/s13636-020-00194-0 2021

Time-frequency scattering accurately models auditory similarities between instrumental playing techniques [0.03%] 时间频率散射能够准确模拟不同乐器演奏技巧之间的听觉相似性

Vincent Lostanlen,Christian El-Hajj,Mathias Rossignol et al. Vincent Lostanlen et al.

Instrumentalplaying techniques such as vibratos, glissandos, and trills often denote musical expressivity, both in classical and folk contexts. However, most existing approaches to music similarity retrieval fail to describe timbre beyond t...

EURASIP journal on audio, speech, and music processing. 2021;2021(1):3. DOI:10.1186/s13636-020-00187-z 2021

Articulation constrained learning with application to speech emotion recognition [0.03%] 基于情感约束的说话人情感识别学习方法研究

Mohit Shah,Ming Tu,Visar Berisha et al. Mohit Shah et al.

Speech emotion recognition methods combining articulatory information with acoustic features have been previously shown to improve recognition performance. Collection of articulatory data on a large scale may not be feasible in many scenari...

EURASIP journal on audio, speech, and music processing. 2019;2019(1):14. DOI:10.1186/s13636-019-0157-9 2019

Biomimetic spectro-temporal features for music instrument recognition in isolated notes and solo phrases [0.03%] 用于单音及独奏片段乐器识别的仿生光谱时域特征

Kailash Patil,Mounya Elhilali Kailash Patil

The identity of musical instruments is reflected in the acoustic attributes of musical notes played with them. Recently, it has been argued that these characteristics of musical identity (or timbre) can be best captured through an analysis ...

EURASIP journal on audio, speech, and music processing. 2015:2015:27. DOI:10.1186/s13636-015-0070-9 2015

Biomimetic multi-resolution analysis for robust speaker recognition [0.03%] 仿生多分辨率分析在鲁棒说话人识别中的应用

Sridhar Krishna Nemala,Dmitry N Zotkin,Ramani Duraiswami et al. Sridhar Krishna Nemala et al.

Humans exhibit a remarkable ability to reliably classify sound sources in the environment even in presence of high levels of noise. In contrast, most engineering systems suffer a drastic drop in performance when speech signals are corrupted...

EURASIP journal on audio, speech, and music processing. 2012:2012:22. DOI:10.1186/1687-4722-2012-22 2012