The Secret Lives of MP3 Files
-
Upload
dkaye -
Category
Economy & Finance
-
view
4.018 -
download
0
description
Transcript of The Secret Lives of MP3 Files
![Page 1: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/1.jpg)
The Secret Livesof MP3 Files
Doug KayeThe Conversations Network
and GigaVox Media
![Page 2: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/2.jpg)
Formats & Encoders
• Lossless (WAV, AIFF)
• Lossy
- MPEG 1, Layer 3 (MP3)
- AAC (AAC, M4A, M4B)
- MPEG I, Layer 2 (MP2)
![Page 3: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/3.jpg)
MPEG Confusion
• Lossy Perceptual/Psychoacoustical Codecs
• MP3 = MPEG-I Layer 3
• MP2 = MPEG-I Layer 2 (not MPEG-II)
![Page 4: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/4.jpg)
Motion Picture Experts Group
• MPEG-1: Video CDs, MP3 Audio
• MPEG-2: Digital TV, Set-Top Boxes
• MPEG-4: Online Multimedia (Video)
• MPEG-7: Audio and Video Search
• MPEG-21: Multimedia Framework
![Page 5: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/5.jpg)
MPEG-1 for Geeks
• Layer 1
• Simple 32-Band Algorithm
• Philips DCC (Digital Compact Cassette)
• Layer 2 (a.k.a. MUSICAM)
• Also 32 Bands
• International Standard for Broadcasting
![Page 6: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/6.jpg)
MPEG-1 Layer 3 (MP3)for Geeks
• Psychoacoustic Masking
• 32 Bands Divided into 576 Subbands
• More Accurate Masking Thresholds
• Redundancy Reduction
• Lossless Huffman Encoding
• Bit-Reservoir Buffering
• Joint Stereo
![Page 7: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/7.jpg)
Sample Rate for Geeks
• The Nyquist Theorem
• Sample at 2x the Highest Frequency
• 22.05kHz Sample Rate for 11kHz Audio
• Sample Rate Is a Property of Uncompressed Source (WAV or AIFF)
![Page 8: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/8.jpg)
Sample Rate in Practice
• Standardize on 44.1kHz Sample Rate
• Flash & Other Players Require n*11.025kHz
• Resample if Source is 48kHz from DVDs
![Page 9: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/9.jpg)
Bit Rate for Geeks
• Independent of Sample Rate
• Specifies Encoder Output File Size (CBR)
• @64kbps, 1 hour ≈ 27MB
• Variable Bit Rate (VBR)
• For Higher Bit Rates Only
• Not Universally Supported (Avoid It)
![Page 10: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/10.jpg)
Bit Rate in Practice
• “Use Higher Bit Rates for Music?”
• It’s a Myth!
• Human Voices Are Complex
• Music Masks Its Own Artifacts
• 64kbps is Most Common Today
• 96kbps is Gaining
![Page 11: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/11.jpg)
Podcasting Bit-Rate History
• June 2003: 32kbps. “Files too large”
• April 2004: 48kbps. “No problem”
• September 2004: 64kbps. “Quality is low”
• Today: Still 64kbps.
• Tomorrow??
![Page 12: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/12.jpg)
Stereo Encoding
• “Stereo MP3s are twice as large as mono.”
• It’s a Myth!
• Only Bit Rate Specifies Output File Size
• You May Want to Use Higher Bit Rates for Stereo
![Page 13: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/13.jpg)
Stereo Encoding for Geeks
• Dual Channel or Independent Channel (IC)
- Entirely Separate Left and Right
• But Most L/R Information is Redundant
• Intensity Stereo (IS)
• Mid/Side Stereo (MS)
• Joint Stereo (JS) Allows IS/MS Combination
![Page 14: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/14.jpg)
Stereo Encoding(Even Geekier)
• JS Encodes L+R and L-R
• If L=R then L-R=0
• Since Bit Rate is ConstantL=R Uses Fewer Bits for Stereo Information
![Page 15: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/15.jpg)
Stereo Encoding in Practice
• Stereo vs. Mono (not Music vs. Voice) is a Good Reason to Use Higher Bit Rates
• Greater Separation Suggests Higher Rates
• If Mostly Speech, Consider 100% Mono
• If Mono, Make L&R Digitally Identical
• Always Encode in Stereo for Compatibility
![Page 16: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/16.jpg)
Mastering for MP3
• Help the Encoder: Eliminate Unnecessary Data
- High-Pass Filter at 80Hz
- Low-Pass Filter at 11kHz (@64kbps encoding)
- Normalize
![Page 17: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/17.jpg)
Which is Louder?
• It’s Not the Height of the Peaks (voltage)
• It’s the Area Under the Curve (power)
![Page 18: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/18.jpg)
Loudness
• What’s the Standard?
• We Asked:
- Podcasters
- Audio Engineers
- Radio Engineers
• Answer: There Isn’t One
• It’s a Hard Problem to Solve
![Page 19: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/19.jpg)
Normalization
• Peak Normalization (common)
- Maximizes Voltage, not Power
• RMS Normalization
- Maximizes Power (=Loudness)
• Determine a Standard Loudness Level
![Page 20: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/20.jpg)
Avoid Recording to MP3!
• MP3 is a final/release format.
• Not designed to be decoded and re-encoded.
• Use MP2 Instead...
• or the highest MP3 bit rate possible.
![Page 21: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/21.jpg)
AAC/M4B Files?
• Yes, AAC is Better Than MP3
• We Added AAC to Support iPod Bookmarks
• Painful: Only iTunes Could Encode M4B
• Doubled Much of Our Workflow
• Can’t Be Easily Assembled
![Page 22: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/22.jpg)
MP2: Why and When?
• MPEG-1 Layer 2
• Designed as an Intermediate Format
• The Standard in Broadcast Radio
• 128kbps per Track
• 44.1kHz Sample Rate Preferred
![Page 23: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/23.jpg)
Audio Lessons Learned
• MP3 Options
• Audio-File Myths
• RMS Normalization (Loudness)
• AAC/M4B Files (iTunes & iPods)
• MP2 Files
![Page 24: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/24.jpg)
To Summarize
• Record at 44.1kHz Sample Rate (not in MP3!)
• Mastering
- RMS Normalization (Pick a Standard Level)
- 80Hz Hi-Pass, 11kHz Low Pass (for voice)
- If Mono, Make L&R Digitally Identical
• Encoding
- 64kbps when L=R
- Consider ≥96kbps for L≠R
- Always Use Joint Stereo
![Page 25: The Secret Lives of MP3 Files](https://reader037.fdocuments.net/reader037/viewer/2022103014/54b4ccff4a7959fd588b4613/html5/thumbnails/25.jpg)