Hybrid NMF APSIPA2014 invited

Hybrid Multichannel Signal Separation Using Supervised Nonnegative Matrix Factorization

Daichi Kitamura, (The University of Tokyo, Japan)

Hiroshi Saruwatari, (The University of Tokyo, Japan)

Satoshi Nakamura, (Nara Institute of Science and Technology, Japan)

Yu Takahashi, (Yamaha Corporation, Japan)

Kazunobu Kondo, (Yamaha Corporation, Japan)

Hirokazu Kameoka, (The University of Tokyo, Japan)

東京大学， YAMAHA

Outline• 1. Research background• 2. Conventional methods

– Nonnegative matrix factorization– Supervised nonnegative matrix factorization– Multichannel NMF

• 3. Proposed method– SNMF with spectrogram restoration and its Hybrid method

• 4. Experiments– Closed data experiment– Open data experiment

• 5. Conclusions

Research background• Signal separation have received much attention.

• Music signal separation based on nonnegative matrix factorization (NMF) is a very active research area.

• Supervised NMF (SNMF) achieves the highest separation performance.

• To improve its performance, SNMF-based multichannel signal separation method is required.

• Automatic music transcription• 3D audio system, etc.

Applications

Separate!

Separate the target signal from multichannel signals with high accuracy.

• 5. Conclusions

• NMF can extract significant spectral patterns.

– Basis matrix has frequently-appearing spectral patterns in .

NMF [Lee, et al., 2001]

Amplitude

Observed matrix(spectrogram)

Basis matrix(spectral patterns)

Activation matrix(Time-varying gain)

: Number of frequency bins: Number of time frames: Number of bases

• SNMF – Supervised spectral separation method

Supervised NMF [Smaragdis, et al., 2007]

Separation process Optimize

Training process

Supervised basis matrix (spectral dictionary)

Sample sounds of target signal

Sample sound

Target signal Other signalMixed signal

Problems of SNMF• SNMF is only for a single-channel signal

– For multichannel signal, SNMF cannot use information between channels.

• When many interference sources exist, separation performance of SNMF markedly degrades.

Separate

Residual components

• Multichannel NMF – is a natural extension of NMF for a multichannel signal– uses spatial information for the clustering of bases to

achieve the unsupervised separation task.

Multichannel NMF [Sawada, et al., 2013]

Problems: Multichannel NMF involve strong dependence on initial values and lack robustness.

Microphone array

• 3. Proposed method– Motivation and strategy– SNMF with spectrogram restoration and its Hybrid method

• 5. Conclusions

• Sawada’s multichannel NMF– is unified method to solve spatial and spectral separations.– Maximizes a likelihood:

– For supervised situation, target spectral patterns is given.

– Too much difficult to solve (lack robustness)– Computationally inefficient (much computational time)

Motivation and strategy

Spatial direction of target signal

Source components of all signals

Target Other

Observed spectrograms

• Proposed hybrid method– divides the problems as follows:

– The spatial separation should be carried out with classical D.O.A. estimation methods.• These methods are very efficient and stable.

– Divide and conquer method

Motivation and strategy

Unsupervised spatial separation

Supervised spectral separation

Approximation

Classical D.O.A. estimation SNMF-based method

Directional clustering [Araki, et al., 2007]

• Directional clustering– Unsupervised spatial separation method– k-means clustering (fast and stable)

• Problems– Artificial distortion arises owing to the binary masking.

CenterLeft

Center

Binary masking

Input signal (stereo) Separated signal

1　 1 1

　0　

1　 1 1

　 1 1　

R　 L R

　C　

C　 C C

　R　

C C C　 C C

　C　

Binary maskSpectrogram

Entry-wise product

Proposed method: hybrid separation• Hybrid separation method

Input stereo signal

Spatial separation method (Directional clustering)

SNMF-based separation method(SNMF with spectrogram restoration)

Separated signal

SNMF with spectrogram restoration

: Holes

Separated cluster Spectral holes (lost components)

The proposed SNMF treats these holes as unseen observationsSupervised basis

Extrapolate the fittest bases

(dictionary of target signal)

Fix up

Center RightLeftDirection

nent (a)

Target

Center RightLeftDirection

nent (c)

Extrapolated components

signal

directionalclustering

super-resolution-based SNMF

Binary masking

yObserved spectrogram

Target

Interference

Extrapolate

Separated cluster

Reconstructed data

Supervised spectral bases

Directional clustering

• The divergence is defined at all grids except for the holes by using the Binary mask matrix .

Decomposition model and cost function

Decomposition model: Supervised bases (Fixed)

: Entries of matrices, , and , respectively: Weighting parameters,: Binary complement, : Frobenius norm

Cost function:

: Binary masking matrix obtained from directional clustering

Cost function:

Binary index to exclude the holes

Regularization term

Cost function:

Regularization termPenalty term[Kitamura, et al. 2014]

Cost function:

• : -divergence [Eguchi, et al., 2001]

– EUC-distance

– KL-divergence

– IS-divergence

Generalized divergence: b -divergence

The best criterion for signal separation [Kitamura, et al., 2014]

• We used two -divergences for the main cost and the regularization cost as and .

Decomposition model:

Cost function: Supervised bases (Fixed)

Update rules• We can obtain the update rules for the optimization of

the variables matrices , , and .

Update rules:

• 5. Conclusions

• Mixed signal includes four melodies (sources).• Three compositions of instruments

– We evaluated the average score of 36 patterns.

Experimental condition

Center

１２３

Left Right

Target source

Supervision signal

24 notes that cover all the notes in the target melody

Dataset Melody 1 Melody 2 Midrange BassNo. 1 Oboe Flute Piano TromboneNo. 2 Trumpet Violin Harpsichord FagottoNo. 3 Horn Clarinet Piano Cello

14121086420

43210bNMF

• Signal-to-distortion ratio (SDR)– total quality of the separation, which includes the degree of

separation and absence of artificial distortion.

Experimental result: closed data

Conventional SNMF(single-channel SNMF)

Proposed hybrid method

Supervised Multichannel NMF [Sawada]

KL-divergence EUC-distance

SNMF with spectrogram restoration• SNMF with spectrogram restoration has two tasks.

• The optimal divergence for source separation is KL-divergence ( ).

• In contrast, a divergence with higher value is suitable for the basis extrapolation.

Source separation

Basis extrapolation

Trade-off: separation and restoration• The optimal divergence for SNMF with spectrogram

restoration and its hybrid method is based on the trade-off between separation and restoration abilities.

-10-8-6-4-20

543210Frequency [kHz]

-10-8-6-4-20

543210Frequency [kHz]

Sparseness: strong Sparseness: weak

Separation

Total performance of the hybrid method

Restoration

0 1 2 3 4

• Closed data experiment– used different Tone generator for training and test signals

Experimental condition

Supervision signal

24 notes that cover all the notes in the target melody

Provided by Tone generator A

Provided by Tone generator B (more real sound)

+ back ground noise (SNR = 10 dB)

Center

１２３

Left Right

Target source

1086420-2-4

43210bNMF

• Signal-to-distortion ratio (SDR)– total quality of the separation, which includes the degree of

separation and absence of artificial distortion.

Experimental result: open data

Conventional SNMF(single-channel SNMF)

Proposed hybrid method

Supervised Multichannel NMF [Sawada]

KL-divergence EUC-distance

Conclusions• We proposed a hybrid multichannel signal separation

method combining directional clustering and SNMF with spectrogram restoration.

• There is a trade-off between separation and restoration abilities.

Thank you for your attention!

Demonstration is available!

Hybrid NMF APSIPA2014 invited

Science

Transcript of Hybrid NMF APSIPA2014 invited

Nmf iic presentation1

NMF-Density: NMF-Based Breast Density Classifier · NMF-Density: NMF-Based Breast Density Classifier ... since it decreases the sensitivity of breast cancer detection. ... taken in

IFMSA/Nmf- Uke

File Format 2.03 (NMF)

차원축소 훑어보기 (PCA, SVD, NMF)

NMF Static Gaskets Speziale oplossingen voor bijzondere ...

NMF Globe Valves Speziale oplossingen voor bijzondere ...nmf-group.com/wp-content/uploads/NMF-Kihsco-Globe-Valves-v13.9.1… · Speziale oplossingen voor bijzondere uitdagingen ...

Block Coordinate Descent for Sparse NMF

Nmf equipments-and-plants-pvt-ltd

1. About NMF II 2.5 Should the application for limited ... · PDF file1. About NMF II 1.1 What is NMF II Platform? Securities and Exchange Board of India has allowed the mutual fund

NMF Conclave Papers 2014

NMF CONCEPTS PVT LTD

Single-channel audio source separation with NMF ...

Package 'NMF' - R project - CRAN

Overlapping community Detection Using Bayesian NMF

NMF HANDBOOK - EQUA

NMF Community Conversations 2010

Derecho NMF

NMF Loading Equipment I Speziale oplossingen voor …nmf-group.com/wp-content/uploads/NMF-Loading-Equipment-I-Loadi… · Speziale oplossingen voor NMF Loading Equipment I Loading

Orthogonal NMF through Subspace Exploration