DTMでの音色検索を対象とした機械学習アルゴリズムの提案(for FIT2016)

34
DTM でででででででででででで ででででででででででででで 2016 09 09 でででででででででで でで 2 で でで で ([email protected]) でで ででで Propose of Machine Learning Algorithms for the Searching Timbre Information on DTM Copyright © 2016 Hajime Saito. All rights reserved.

Transcript of DTMでの音色検索を対象とした機械学習アルゴリズムの提案(for FIT2016)

PowerPoint

DTM20160909 2([email protected])

Propose of Machine Learning Algorithms for the Searching Timbre Information on DTMCopyright 2016 Hajime Saito. All rights reserved.

0

2016/09/09FIT2016 E-25DTM1

Obamichikolab

1

2016/09/09FIT2016 E-25DTM2

PC(DTMDeskTop Music)

(CGMConsumer Generated Media)[1][1], : , , vol. 21, no. 1, pp. 2936, Aug. 2014.

Obamichikolab

(CGMConsumer Generated Media)[1]PC(DTMDeskTop Music)[1], : , , vol. 21, no. 1, pp. 2936, Aug. 2014.

2

2016/09/09FIT2016 E-25DTM3()

Obamichikolab

3

2016/09/09FIT2016 E-25DTM4

TASS[2]

(Synth1,)(,)(,)

Timbre Adjustment Support System[2], , DTM(DeskTop Music), presented at the 77, 2015.DTM

Obamichikolab

TASS(Timbre Adjustment Support System)()4

2016/09/09FIT2016 E-25DTM5

Obamichikolab

TASShttps://youtu.be/IIfFO_s8gC05

2016/09/09FIT2016 E-25DTM6

TASS

AB

Obamichikolab

6

2016/09/09FIT2016 E-25DTM7

DTM

DTM(DeskTop Music) [3][3], , DTM(DeskTop Music), MUS, vol. 2016-MUS-110, no. 15, pp. 16, Feb. 2016.

Obamichikolab

DTM(DeskTop Music) [3][3], , DTM(DeskTop Music), MUS, vol. 2016-MUS-110, no. 15, pp. 16, Feb. 2016.

7

2016/09/09FIT2016 E-25DTM8

TASS

MFCC12

Obamichikolab

HEVNERMFCC8

2016/09/09FIT2016 E-25DTM9HEVNER[4][4]K. HEVNER, experimental studies of the elements of expression in music, American Journal of Psychology, vol. 48, pp. 246268, 1936.

/ Serious / Sacred

C1 / Dark / Sad

C2 / Dreamy / Sentimental

C3 / Calm / Sacred

C4 / Delicate / Light

C5 / Happy / Cheerful

C6 / Dramatic / Sensational

C7 / Majestic / Exalting

C8

Obamichikolab

HEVNER9

2016/09/09FIT2016 E-25DTM10()[5][5], Q, . [Online]. Available: http://abcpedia.acoustics.jp/acoustic_feature_2.pdf. [Accessed: 30-Dec-2015].[6], MFCC - Miyazawas Pukiwiki , Miyazawas Pukiwiki , 29-Mar-2013. [Online]. Available: http://shower.human.waseda.ac.jp/~m-kouki/pukiwiki_public/66.html. [Accessed: 07-Jul-2015].MFCC(Mel Frequency Cepstral Coefficient)[6]

0s ()0h ()

MFCC

Obamichikolab

MFCC10

DTM

DTM

TASSv22016/09/09FIT2016 E-25DTM11

Obamichikolab

DTMDTMTASSv211

2016/09/09FIT2016 E-25DTM12

Obamichikolab

12

2016/09/09FIT2016 E-25DTM13

SVM(SupportVectorMachine)TASSv2

Obamichikolab

13

SVM(SupportVectorMachine)2016/09/09FIT2016 E-25DTM14

2

Obamichikolab

SVM14

2016/09/09FIT2016 E-25DTM15

()x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)

TASS

Obamichikolab

15

2016/09/09FIT2016 E-25DTM16

()

TASS

x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)

Obamichikolab

16

2016/09/09FIT2016 E-25DTM17

()

TASS

x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)

Obamichikolab

17

2016/09/09FIT2016 E-25DTM18

()

TASS

x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)

Obamichikolab

18

2016/09/09FIT2016 E-25DTM19

2345

SVMSVM(SupportVectorMachine)[7][7], , , p. 21.

Obamichikolab

SVM()19

2016/09/09FIT2016 E-25DTM20SVM

idClassmfcc-1mfcc-2mfcc-3mfcc-4mfcc-261-1.5113130810.1441923980.1227015630.386854559-0.0121400652**1.014427781-0.64295423-0.376046747-0.1045032960.0098016063***0.739102602-0.188990891-0.642014027-0.249793604-0.01166570525*1.1828038690.8783208730.096936785-0.0210610910.028196063

C1- * *

MFCC

MFCC

Obamichikolab

SVM20

3825SVMLeave-One-Out2016/09/09FIT2016 E-25DTM21

11

24

()

123n(=25)12n(=25)

Obamichikolab

3825SVMLeave-One-Out

21

2016/09/09FIT2016 E-25DTM22

(%)ABCSVM249.566.562.059.3354.560.045.053.2450.058.041.549.8550.559.539.049.72050.551.540.547.584.241.244.455.6

Obamichikolab

22

(%)ABCSVM249.566.562.059.3354.560.045.053.2450.058.041.549.8550.559.539.049.72050.551.540.547.584.241.244.455.6

2016/09/09FIT2016 E-25DTM23

SVM50

()SVM

Obamichikolab

SVM5023

(%)ABCSVM249.566.562.059.3354.560.045.053.2450.058.041.549.8550.559.539.049.72050.551.540.547.584.241.244.455.6

2016/09/09FIT2016 E-25DTM24

SVM

Obamichikolab

SVM24

2016/09/09FIT2016 E-25DTM25

x(1,2,3,,8):C(C1,C2,C3,,C8)fx:m:mfccCx:Ix:n:Px:(Cx)Px:(Cx)

TASS

(2)

Obamichikolab

25

2016/09/09FIT2016 E-25DTM26

TASS

(2)1.

C1C3C4C6

x(1,2,3,,8):C(C1,C2,C3,,C8)fx:m:mfccCx:Ix:n:Px:(Cx)Px:(Cx)

Obamichikolab

26

2016/09/09FIT2016 E-25DTM27

TASS

(2)

x(1,2,3,,8):C(C1,C2,C3,,C8)fx:m:mfccCx:Ix:n:Px:(Cx)Px:(Cx)

Obamichikolab

.27

2016/09/09FIT2016 E-25DTM28

(%)ABCSVM2050.551.540.547.6(1)84.241.244.455.6(2)70.656.375.067.9

(2)

Obamichikolab

SVM()(2)28

2016/09/09FIT2016 E-25DTM29

Obamichikolab

29

2016/09/09FIT2016 E-25DTM30TASSTASSv2

Obamichikolab

TASSTASS V230

2016/09/09FIT2016 E-25DTM31

TASSv2TASSv2

Obamichikolab

TASS V231

[1], : , , vol. 21, no. 1, pp. 2936, Aug. 2014.[2], , DTM(DeskTop Music), presented at the 77, 2015.[3] and , DTM(DeskTop Music), MUS, vol. 2016-MUS-110, no. 15, pp. 16, Feb. 2016.[4]K. HEVNER, experimental studies of the elements of expression in music, American Journal of Psychology, vol. 48, pp. 246268, 1936.[5], Q, . [Online]. Available: http://abcpedia.acoustics.jp/acoustic_feature_2.pdf. [Accessed: 30-Dec-2015].[6], MFCC - Miyazawas Pukiwiki , Miyazawas Pukiwiki , 29-Mar-2013. [Online]. Available: http://shower.human.waseda.ac.jp/~m-kouki/pukiwiki_public/66.html. [Accessed: 07-Jul-2015].[7], , , p. 0-21.

2016/09/09FIT2016 E-25DTM32

Obamichikolab

DTM(2)2016/09/09FIT2016 E-25DTM33

Obamichikolab