This page provides audio examples that illustrate the results of the different levels of NMFD side information as well as our proposed post-processing methods. The intention is to point out differences that are not well captured by the PEASS and BSSEval [1] metrics. The audio item on this page has also been used as an illustrative example for the cross-talk reduction in the paper.
Test set item: WaveDrum02_56
Mixture
Activate
Play
Stop
Repeat
--:--:--:--- / --:--:--:---
Original mixture signal
Solo
Kick drum
Activate
Play
Stop
Repeat
--:--:--:--- / --:--:--:---
Original component signal
Solo
Oracle-based separation [Case 0]
Solo
Fully informed separation with fixed templates [Case 1a]
Solo
Fully informed separation with adaptive templates [Case 1b]
Solo
Audio-informed separation [Case 2]
Solo
Score-informed separation [Case 3]
Solo
Uninformed separation [Case 4]
Solo
Temporal cross talk attenuation [Cascade]
Solo
Spectral cross-talk attenuation [Dictionary]
Solo
Both methods in succession [Combined]
Solo
Snare drum
Activate
Play
Stop
Repeat
--:--:--:--- / --:--:--:---
Original component signal
Solo
Oracle-based separation [Case 0]
Solo
Fully informed separation with fixed templates [Case 1a]
Solo
Fully informed separation with adaptive templates [Case 1b]
Solo
Audio-informed separation [Case 2]
Solo
Score-informed separation [Case 3]
Solo
Uninformed separation [Case 4]
Solo
Temporal cross talk attenuation [Cascade]
Solo
Spectral cross-talk attenuation [Dictionary]
Solo
Both methods in succession [Combined]
Solo
Hi-hat
Activate
Play
Stop
Repeat
--:--:--:--- / --:--:--:---
Original component signal
Solo
Oracle-based separation [Case 0]
Solo
Fully informed separation with fixed templates [Case 1a]
Solo
Fully informed separation with adaptive templates [Case 1b]
Valentin Emiya, Emmanuel Vincent, Niklas Harlander, and Volker Hohmann Subjective and Objective Quality Assessment of Audio Source Separation IEEE Transactions on Audio, Speech, and Language Processing, 19(7): 2046–2057, 2011.
@article{EmiyaVHH11_QualityEvalSourceSep_IEEE-TASLP,
author = {Valentin Emiya and Emmanuel Vincent and Niklas Harlander and Volker Hohmann},
title = {{S}ubjective and {O}bjective {Q}uality {A}ssessment of {A}udio {S}ource {S}eparation},
journal = {{IEEE} Transactions on Audio, Speech, and Language Processing},
volume = {19},
number = {7},
year = {2011},
pages = {2046--2057},
}