DTMでの音色検索を対象とした機械学習アルゴリズムの提案(for FIT2016)
-
Upload
hajime-saito -
Category
Technology
-
view
147 -
download
1
Transcript of DTMでの音色検索を対象とした機械学習アルゴリズムの提案(for FIT2016)
PowerPoint
DTM20160909 2([email protected])
Propose of Machine Learning Algorithms for the Searching Timbre Information on DTMCopyright 2016 Hajime Saito. All rights reserved.
0
2016/09/09FIT2016 E-25DTM1
Obamichikolab
1
2016/09/09FIT2016 E-25DTM2
PC(DTMDeskTop Music)
(CGMConsumer Generated Media)[1][1], : , , vol. 21, no. 1, pp. 2936, Aug. 2014.
Obamichikolab
(CGMConsumer Generated Media)[1]PC(DTMDeskTop Music)[1], : , , vol. 21, no. 1, pp. 2936, Aug. 2014.
2
2016/09/09FIT2016 E-25DTM3()
Obamichikolab
3
2016/09/09FIT2016 E-25DTM4
TASS[2]
(Synth1,)(,)(,)
Timbre Adjustment Support System[2], , DTM(DeskTop Music), presented at the 77, 2015.DTM
Obamichikolab
TASS(Timbre Adjustment Support System)()4
2016/09/09FIT2016 E-25DTM5
Obamichikolab
TASShttps://youtu.be/IIfFO_s8gC05
2016/09/09FIT2016 E-25DTM6
TASS
AB
Obamichikolab
6
2016/09/09FIT2016 E-25DTM7
DTM
DTM(DeskTop Music) [3][3], , DTM(DeskTop Music), MUS, vol. 2016-MUS-110, no. 15, pp. 16, Feb. 2016.
Obamichikolab
DTM(DeskTop Music) [3][3], , DTM(DeskTop Music), MUS, vol. 2016-MUS-110, no. 15, pp. 16, Feb. 2016.
7
2016/09/09FIT2016 E-25DTM8
TASS
MFCC12
Obamichikolab
HEVNERMFCC8
2016/09/09FIT2016 E-25DTM9HEVNER[4][4]K. HEVNER, experimental studies of the elements of expression in music, American Journal of Psychology, vol. 48, pp. 246268, 1936.
/ Serious / Sacred
C1 / Dark / Sad
C2 / Dreamy / Sentimental
C3 / Calm / Sacred
C4 / Delicate / Light
C5 / Happy / Cheerful
C6 / Dramatic / Sensational
C7 / Majestic / Exalting
C8
Obamichikolab
HEVNER9
2016/09/09FIT2016 E-25DTM10()[5][5], Q, . [Online]. Available: http://abcpedia.acoustics.jp/acoustic_feature_2.pdf. [Accessed: 30-Dec-2015].[6], MFCC - Miyazawas Pukiwiki , Miyazawas Pukiwiki , 29-Mar-2013. [Online]. Available: http://shower.human.waseda.ac.jp/~m-kouki/pukiwiki_public/66.html. [Accessed: 07-Jul-2015].MFCC(Mel Frequency Cepstral Coefficient)[6]
0s ()0h ()
MFCC
Obamichikolab
MFCC10
DTM
DTM
TASSv22016/09/09FIT2016 E-25DTM11
Obamichikolab
DTMDTMTASSv211
2016/09/09FIT2016 E-25DTM12
Obamichikolab
12
2016/09/09FIT2016 E-25DTM13
SVM(SupportVectorMachine)TASSv2
Obamichikolab
13
SVM(SupportVectorMachine)2016/09/09FIT2016 E-25DTM14
2
Obamichikolab
SVM14
2016/09/09FIT2016 E-25DTM15
()x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)
TASS
Obamichikolab
15
2016/09/09FIT2016 E-25DTM16
()
TASS
x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)
Obamichikolab
16
2016/09/09FIT2016 E-25DTM17
()
TASS
x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)
Obamichikolab
17
2016/09/09FIT2016 E-25DTM18
()
TASS
x(1,2,3,,8):y(0,1,2,,12):mfccfx:my:mfccCx:C(C1,C2,C3,,C8)Ix:Px:(Cx)
Obamichikolab
18
2016/09/09FIT2016 E-25DTM19
2345
SVMSVM(SupportVectorMachine)[7][7], , , p. 21.
Obamichikolab
SVM()19
2016/09/09FIT2016 E-25DTM20SVM
idClassmfcc-1mfcc-2mfcc-3mfcc-4mfcc-261-1.5113130810.1441923980.1227015630.386854559-0.0121400652**1.014427781-0.64295423-0.376046747-0.1045032960.0098016063***0.739102602-0.188990891-0.642014027-0.249793604-0.01166570525*1.1828038690.8783208730.096936785-0.0210610910.028196063
C1- * *
MFCC
MFCC
Obamichikolab
SVM20
3825SVMLeave-One-Out2016/09/09FIT2016 E-25DTM21
11
24
()
123n(=25)12n(=25)
Obamichikolab
3825SVMLeave-One-Out
21
2016/09/09FIT2016 E-25DTM22
(%)ABCSVM249.566.562.059.3354.560.045.053.2450.058.041.549.8550.559.539.049.72050.551.540.547.584.241.244.455.6
Obamichikolab
22
(%)ABCSVM249.566.562.059.3354.560.045.053.2450.058.041.549.8550.559.539.049.72050.551.540.547.584.241.244.455.6
2016/09/09FIT2016 E-25DTM23
SVM50
()SVM
Obamichikolab
SVM5023
(%)ABCSVM249.566.562.059.3354.560.045.053.2450.058.041.549.8550.559.539.049.72050.551.540.547.584.241.244.455.6
2016/09/09FIT2016 E-25DTM24
SVM
Obamichikolab
SVM24
2016/09/09FIT2016 E-25DTM25
x(1,2,3,,8):C(C1,C2,C3,,C8)fx:m:mfccCx:Ix:n:Px:(Cx)Px:(Cx)
TASS
(2)
Obamichikolab
25
2016/09/09FIT2016 E-25DTM26
TASS
(2)1.
C1C3C4C6
x(1,2,3,,8):C(C1,C2,C3,,C8)fx:m:mfccCx:Ix:n:Px:(Cx)Px:(Cx)
Obamichikolab
26
2016/09/09FIT2016 E-25DTM27
TASS
(2)
x(1,2,3,,8):C(C1,C2,C3,,C8)fx:m:mfccCx:Ix:n:Px:(Cx)Px:(Cx)
Obamichikolab
.27
2016/09/09FIT2016 E-25DTM28
(%)ABCSVM2050.551.540.547.6(1)84.241.244.455.6(2)70.656.375.067.9
(2)
Obamichikolab
SVM()(2)28
2016/09/09FIT2016 E-25DTM29
Obamichikolab
29
2016/09/09FIT2016 E-25DTM30TASSTASSv2
Obamichikolab
TASSTASS V230
2016/09/09FIT2016 E-25DTM31
TASSv2TASSv2
Obamichikolab
TASS V231
[1], : , , vol. 21, no. 1, pp. 2936, Aug. 2014.[2], , DTM(DeskTop Music), presented at the 77, 2015.[3] and , DTM(DeskTop Music), MUS, vol. 2016-MUS-110, no. 15, pp. 16, Feb. 2016.[4]K. HEVNER, experimental studies of the elements of expression in music, American Journal of Psychology, vol. 48, pp. 246268, 1936.[5], Q, . [Online]. Available: http://abcpedia.acoustics.jp/acoustic_feature_2.pdf. [Accessed: 30-Dec-2015].[6], MFCC - Miyazawas Pukiwiki , Miyazawas Pukiwiki , 29-Mar-2013. [Online]. Available: http://shower.human.waseda.ac.jp/~m-kouki/pukiwiki_public/66.html. [Accessed: 07-Jul-2015].[7], , , p. 0-21.
2016/09/09FIT2016 E-25DTM32
Obamichikolab
DTM(2)2016/09/09FIT2016 E-25DTM33
Obamichikolab