l Machine Learning l Speech separation l Recommender system 2.
-
Upload
david-cross -
Category
Documents
-
view
231 -
download
6
Transcript of l Machine Learning l Speech separation l Recommender system 2.
![Page 1: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/1.jpg)
![Page 2: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/2.jpg)
Machine Learning Speech separation Recommender system
2
![Page 3: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/3.jpg)
“ 机器学习是一门人工智能的科学,该领域的主要研究对象是人工智能,特别是如何在经验学习中改善具体算法的性能”。
“ 机器学习是对能通过经验自动改进的计算机算法的研究”。
A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.
3
![Page 4: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/4.jpg)
4
![Page 5: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/5.jpg)
Machine perception/Speech recognition/Computer vision/Natural language processing/Sentiment Analysis
Search engines Medical diagnosis Bioinfomatics/Cheminformatics Anomaly detection/Detecting credit card fraud Time series modeling/Stock market
analysis/Computational finance Game playing Information retrieval/Recommender systems Social Networks/Big data
5
![Page 6: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/6.jpg)
Machine Learning Speech separation Recommender system
6
![Page 7: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/7.jpg)
In a daily environment, target speech is often corrupted by various types of acoustic interference
How to remove or attenuate background noise?
Much more difficult than binaural (multichannel) speech separation.
7
![Page 8: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/8.jpg)
8
![Page 9: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/9.jpg)
Noisy speech Time-frequency representation
A main goal: Ideal binary mask (IBM) The definition of the ideal binary mask
It is a classification task: Given noisy speech Binary matrix
Method: Supervised learning
otherwise0
f),localSNR(t1),(
ftm
9
![Page 10: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/10.jpg)
10
![Page 11: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/11.jpg)
• Female Speech + Factory Noise (0dB, 64-channel)
• Noisy speech
• IBM
• Proposed
![Page 12: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/12.jpg)
Wang et al. (2008)
12
96 dB
72 dB
48 dB
24 dB
0 dB
![Page 13: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/13.jpg)
Machine Learning Speech separation Recommender system
13
![Page 14: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/14.jpg)
问题 : 预测一个人给一部电影的评分 用户: 480,189 电影: 17,770
14
仅有 100 million ratings 实际 8.5 billion potential ratings 99% 未评分!
![Page 15: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/15.jpg)
三年期限,第一个超过 Netflix 算法 10% 的获奖 2009 年 9 月, AT&T 小组 “ BellKor’s
Pragmatic Chaos” 获得大奖 $1,000,000
15
• 如何预测?• 相似用户评分相似• 相似电影评分相似• Latent factor 隐
变量
![Page 16: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/16.jpg)
16
![Page 17: l Machine Learning l Speech separation l Recommender system 2.](https://reader035.fdocuments.net/reader035/viewer/2022062315/5697c01e1a28abf838cd0df6/html5/thumbnails/17.jpg)
Amazon: Facebook: LinkedIn: Last.fm: 豆瓣,虾米,人人, QQ…………………….