HOME > 상세정보

상세정보

Audio content analysis : an introduction : applications in signal processing and music informatics

Audio content analysis : an introduction : applications in signal processing and music informatics (3회 대출)

자료유형
단행본
개인저자
Lerch, Alexander.
서명 / 저자사항
Audio content analysis : an introduction : applications in signal processing and music informatics / Alexander Lerch.
발행사항
Hoboken, N.J. :   Wiley,   c2012.  
형태사항
xxii, 248 p. : ill. ; 26 cm.
ISBN
9781118266823 (hardback)
요약
"With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal processing, psychoacoustics, and music theory, as well as downloadable MATLAB files are also included"--
서지주기
Includes bibliographical references (p. 207-242) and index.
일반주제명
Computer sound processing. Computational auditory scene analysis. Content analysis (Communication) --Data processing.
000 00000cam u2200205 a 4500
001 000045872291
005 20190228154746
008 160520s2012 njua b 001 0 eng d
010 ▼a 2012008107
020 ▼a 9781118266823 (hardback)
035 ▼a (KERIS)REF000016774858
040 ▼a DLC ▼c DLC ▼d DLC ▼d 211009
050 0 0 ▼a TK7881.4 ▼b .L485 2012
082 0 0 ▼a 006.4/5 ▼2 23
084 ▼a 006.45 ▼2 DDCK
090 ▼a 006.45 ▼b L614a
100 1 ▼a Lerch, Alexander.
245 1 0 ▼a Audio content analysis : ▼b an introduction : applications in signal processing and music informatics / ▼c Alexander Lerch.
260 ▼a Hoboken, N.J. : ▼b Wiley, ▼c c2012.
300 ▼a xxii, 248 p. : ▼b ill. ; ▼c 26 cm.
504 ▼a Includes bibliographical references (p. 207-242) and index.
520 ▼a "With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal processing, psychoacoustics, and music theory, as well as downloadable MATLAB files are also included"-- ▼c Provided by publisher.
650 0 ▼a Computer sound processing.
650 0 ▼a Computational auditory scene analysis.
650 0 ▼a Content analysis (Communication) ▼x Data processing.
945 ▼a KLPA

소장정보

No. 소장처 청구기호 등록번호 도서상태 반납예정일 예약 서비스
No. 1 소장처 과학도서관/Sci-Info(2층서고)/ 청구기호 006.45 L614a 등록번호 121236604 (2회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M
No. 2 소장처 과학도서관/Sci-Info(2층서고)/ 청구기호 006.45 L614a 등록번호 121248062 (1회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M

컨텐츠정보

책소개

With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal processing, psychoacoustics, and music theory, as well as downloadable MATLAB files are also included.

Please visit the companion website: www.AudioContentAnalysis.org



New feature

An easily accessible, hands-on approach to digital audio signal processing

With the proliferation of digital audio distribution over digital media, the amount of easily accessible music is ever-growing, requiring new tools for navigating, accessing, and retrieving music in meaningful ways. An understanding of audio content analysis is essential for the design of intelligent music information retrieval applications and content-adaptive audio processing systems.

This book is about how to teach a computer to interpret music signals, thus allowing the design of tools for interacting with music. This book serves as a comprehensive guide on audio content analysis and how to apply it in signal processing and music informatics. Written by a well-known expert in the music industry, An Introduction to Audio Content Analysis ties together topics from audio signal processing and machine learning, showing how to use audio content analysis to pick up musical characteristics automatically. The author clearly explains the analysis of audio signals and the extraction of metadata describing the content of the signal, covering both abstract descriptions of technical properties and musical descriptions such as tempo, harmony and key, musical style, and performance attributes. Musical information is given a separate analysis in each category, whether tonal, pitch, harmony, key, temporal, or tempo, among others.

Readers will get access to various analysis algorithms and learn to compare different standard approaches to the same task. The book includes a review of the fundamentals of audio signal processing, psychoacoustics, and music theory.

An invaluable guide for newcomers to audio signal processing and industry experts alike, An Introduction to Audio Content Analysis also features downloadable MATLAB files from a companion website, www.AudioContentAnalysis.org, lists of abbreviations and symbols, and references.




정보제공 : Aladin

목차

List of Figures xiii

List of Tables xvii

Preface xix

Acronyms xxi

List of Symbols xxv

1 Introduction 1

1.1 Audio Content 3

1.2 A Generalized Audio Content Analysis System 4

2 Fundamentals 7

2.1 Audio Signals 7

2.1.1 Periodic Signals 7

2.1.2 Random Signals 9

2.1.3 Sampling and Quantization 9

2.1.4 Statistical Signal Description 13

2.2 Signal Processing 14

2.2.1 Convolution 14

2.2.2 BlockBased Processing 18

2.2.3 Fourier Transform 20

2.2.4 Constant Q Transform 23

2.2.5 Auditory Filterbanks 24

2.2.6 Correlation Function 24

2.2.7 Linear Prediction 28

3 Instantaneous Features 31

3.1 Audio PreProcessing 33

3.1.1 DownMixing 33

3.1.2 DC Removal 33

3.1.3 Normalization 34

3.1.4 DownSampling 34

3.1.5 Other PreProcessing Options 35

3.2 Statistical Properties 35

3.2.1 Arithmetic Mean 36

3.2.2 Geometric Mean 36

3.2.3 Harmonic Mean 36

3.2.4 Generalized Mean 36

3.2.5 Centroid 37

3.2.6 Variance and Standard Deviation 37

3.2.7 Skewness 38

3.2.8 Kurtosis 39

3.2.9 Generalized Central Moments 40

3.2.10 Quantiles and Quantile Ranges 40

3.3 Spectral Shape 41

3.3.1 Spectral Rolloff 42

3.3.2 Spectral Flux 44

3.3.3 Spectral Centroid 45

3.3.4 Spectral Spread 47

3.3.5 Spectral Decrease 48

3.3.6 Spectral Slope 49

3.3.7 Mel Frequency Cepstral Coefficients 51

3.4 Signal Properties 54

3.4.1 Tonalness 54

3.4.2 Auto Correlation Coefficients 61

3.4.3 Zero Crossing Rate 62

3.5 Feature PostProcessing 63

3.5.1 Derived Features 64

3.5.2 Normalization and Mapping 65

3.5.3 Subfeatures 66

3.5.4 Feature Dimensionality Reduction 66

4 Intensity 71

4.1 Human Perception of Intensity and Loudness 71

4.2 Representation of Dynamics in Music 73

4.3 Features 73

4.3.1 Root Mean Square 73

4.4 Peak Envelope 76

4.5 PsychoAcoustic Loudness Features 77

4.5.1 EBU R128 78

5 Tonal Analysis 79

5.1 Human Perception of Pitch 79

5.1.1 Pitch Scales 79

5.1.2 Chroma Perception 81

5.2 Representation of Pitch in Music 82

5.2.1 Pitch Classes and Names 82

5.2.2 Intervals 83

5.2.3 Root Note, Mode, and Key 83

5.2.4 Chords and Harmony 86

5.2.5 The Frequency of Musical Pitch 88

5.3 Fundamental Frequency Detection 91

5.3.1 Detection Accuracy 92

5.3.2 PreProcessing 94

5.3.3 Monophonic Input Signals 97

5.3.4 Polyphonic Input Signals 103

5.4 Tuning Frequency Estimation 106

5.5 Key Detection 108

5.5.1 Pitch Chroma 108

5.5.2 Key Recognition 112

5.6 Chord Recognition 116

6 Temporal Analysis 119

6.1 Human Perception of Temporal Events 119

6.1.1 Onsets 119

6.1.2 Tempo and Meter 122

6.1.3 Rhythm 122

6.1.4 Timing 123

6.2 Representation of Temporal Events in Music 123

6.2.1 Tempo and Time Signature 123

6.2.2 Note Value 124

6.3 Onset Detection 124

6.3.1 Novelty Function 125

6.4 Beat Histogram 133

6.4.1 Beat Histogram Features 134

6.5 Detection of Tempo and Beat Phase 135

6.6 Detection of Meter and Downbeat 136

7 Alignment 139

7.1 Dynamic Time Warping 139

7.1.1 Example 143

7.1.2 Common Variants 144

7.1.3 Optimizations 145

7.2 AudiotoAudio Alignment 146

7.2.1 Ground Truth Data for Evaluation 147

7.3 AudiotoScore Alignment 148

7.3.1 RealTime Systems 148

7.3.2 Non RealTime Systems 149

8 Musical Genre, Similarity and Mood 151

8.1 Musical Genre Classification 151

8.1.1 Musical Genre 152

8.1.2 Feature Extraction 154

8.1.3 Classification 155

8.2 Related Research Fields 156

8.2.1 Music Similarity Detection 156

8.2.2 Mood Classification 158

8.2.3 Instrument Recognition 161

9 Audio Fingerprinting 163

9.1 Fingerprint Extraction 164

9.2 Fingerprint Matching 165

9.3 Fingerprinting System: Example 166

10 Music Performance Analysis 169

10.1 Musical Communication 169

10.1.1 Score 169

10.1.2 Music Performance 170

10.1.3 Production 172

10.1.4 Recipient 172

10.2 Music Performance Analysis 172

10.2.1 Analysis Data 174

10.2.2 Research Results 177

A Convolution Properties 181

A.1 Identity 181

A.2 Commutativity 181

A.3 Associativity 182

A.4 Distributivity 183

A.5 Circularity 183

B Fourier Transform 185

B.1 Properties of the Fourier Transformation 186

B.1.1 Inverse Fourier Transform 186

B.1.2 Superposition 186

B.1.3 Convolution and Multiplication 186

B.1.4 Parseval’s Theorem 187

B.1.5 Time and Frequency Shift 188

B.1.6 Symmetry 188

B.1.7 Time and Frequency Scaling 189

B.1.8 Derivatives 190

B.2 Spectrum of Example Time Domain Signals 190

B.2.1 Delta Function 190

B.2.2 Constant 190

B.2.3 Cosine 190

B.2.4 Rectangular Window 191

B.2.5 Delta Pulse 191

B.3 Transformation of sampled time signals 191

B.4 Short Time Fourier Transform of Continuous Signals 192

B.4.1 Window Functions 193

B.5 Discrete Fourier Transform 195

B.5.1 Window Functions 196

B.5.2 Fast Fourier Transform 197

C Principal Component Analysis 199

C.1 Computation of the Transformation Matrix 200

C.2 Interpretation of the Transformation Matrix 200

D Software for Audio Analysis 201

D.1 Software Frameworks & Applications 202

D.1.1 Marsyas 202

D.1.2 CLAM 202

D.1.3 jMIR 203

D.1.4 CoMIRVA 203

D.1.5 Sonic Visualiser 203

D.2 Software Libraries & Toolboxes 204

D.2.1 Feature Extraction 204

D.2.2 Plugin Interfaces 205

D.2.3 Other Software 206

References 207

Index 237


정보제공 : Aladin

관련분야 신착자료

Negro, Alessandro (2026)
Dyer-Witheford, Nick (2026)
양성봉 (2025)