HOME > 상세정보

상세정보

Speech and audio processing for coding, enhancement and recognition [electronic resource]

Speech and audio processing for coding, enhancement and recognition [electronic resource]

자료유형
E-Book(소장)
개인저자
Ogunfunmi, Tokunbo. Togneri, Roberto. Narasimha, Madihally (Sim).
서명 / 저자사항
Speech and audio processing for coding, enhancement and recognition [electronic resource] / Tokunbo Ogunfunmi, Roberto Togneri, Madihally (Sim) Narasimha, editors.
발행사항
New York :   Springer US :   Imprint: Springer,   c2015.  
형태사항
1 online resource (x, 345 p.) : col. ill.
ISBN
9781493914562
요약
This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas. Offers readers a single-source reference on the significant applications of speech and audio processing to speech coding, speech enhancement and speech/speaker recognition. Enables readers involved in algorithm development and implementation issues for speech coding to understand the historical development and future challenges in speech coding research; Discusses speech coding methods yielding bit-streams that are multi-rate and scalable for Voice-over-IP (VoIP) Networks Presents an overview of recent developments in conversational speech coding technologies, important new algorithmic advances, and recent standardization activities in ITU-T, 3GPP, 3GPP2, MPEG and IETF that offer a significantly improved user experience during voice calls on existing and future communication systems; Presents an overview of ensemble learning efforts based on different machine learning techniques that have emerged in automatic speech recognition in recent years; Emphasizes signal processing for efficient time-domain and spectral-domain representations, reduction of noise, channel and session variabilities, extraction of temporal and spectral features for recognition and modeling; Informs readers of the latest research and developments in advanced statistical estimation and deep neural networks for speech recognition; Presents readers with the architectural framework and key approaches involved in the “hot” research areas of emotion recognition and speaker diairization systems; Provides readers with a more enriching view of state of the art research in speech enhancement arising from novel multi-microphone and time-frequency solutions.
일반주기
Title from e-Book title page.  
내용주기
From ‘Harmonic Telegraph’ to Cellular Phones -- Challenges in Speech Coding Research -- Recent Speech Coding Technologies and Standards -- Ensemble Learning Approaches in Speech Recognition -- Dynamic and Deep Networks For Speech Modeling and Recognition -- Speech Based Emotion Recognition -- Speaker Diarization: Challenges and Emerging Research -- Maximum a posteriori spectral estimation with source log-spectral priors for multichannel speech enhancement -- Modulation Processing for Speech Enhancement.
서지주기
Includes bibliographical references.
이용가능한 다른형태자료
Issued also as a book.  
일반주제명
Speech processing systems. Automatic speech recognition.
바로가기
URL
000 00000nam u2200205 a 4500
001 000046033016
005 20200619163147
006 m d
007 cr
008 200618s2015 nyua ob 000 0 eng d
020 ▼a 9781493914562
040 ▼a 211009 ▼c 211009 ▼d 211009
050 0 0 ▼a TK7882.S65
082 0 4 ▼a 006.4/54 ▼2 23
084 ▼a 006.454 ▼2 DDCK
090 ▼a 006.454
245 0 0 ▼a Speech and audio processing for coding, enhancement and recognition ▼h [electronic resource] / ▼c Tokunbo Ogunfunmi, Roberto Togneri, Madihally (Sim) Narasimha, editors.
260 ▼a New York : ▼b Springer US : ▼b Imprint: Springer, ▼c c2015.
300 ▼a 1 online resource (x, 345 p.) : ▼b col. ill.
500 ▼a Title from e-Book title page.
504 ▼a Includes bibliographical references.
505 0 ▼a From ‘Harmonic Telegraph’ to Cellular Phones -- Challenges in Speech Coding Research -- Recent Speech Coding Technologies and Standards -- Ensemble Learning Approaches in Speech Recognition -- Dynamic and Deep Networks For Speech Modeling and Recognition -- Speech Based Emotion Recognition -- Speaker Diarization: Challenges and Emerging Research -- Maximum a posteriori spectral estimation with source log-spectral priors for multichannel speech enhancement -- Modulation Processing for Speech Enhancement.
520 ▼a This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas. Offers readers a single-source reference on the significant applications of speech and audio processing to speech coding, speech enhancement and speech/speaker recognition. Enables readers involved in algorithm development and implementation issues for speech coding to understand the historical development and future challenges in speech coding research; Discusses speech coding methods yielding bit-streams that are multi-rate and scalable for Voice-over-IP (VoIP) Networks Presents an overview of recent developments in conversational speech coding technologies, important new algorithmic advances, and recent standardization activities in ITU-T, 3GPP, 3GPP2, MPEG and IETF that offer a significantly improved user experience during voice calls on existing and future communication systems; Presents an overview of ensemble learning efforts based on different machine learning techniques that have emerged in automatic speech recognition in recent years; Emphasizes signal processing for efficient time-domain and spectral-domain representations, reduction of noise, channel and session variabilities, extraction of temporal and spectral features for recognition and modeling; Informs readers of the latest research and developments in advanced statistical estimation and deep neural networks for speech recognition; Presents readers with the architectural framework and key approaches involved in the “hot” research areas of emotion recognition and speaker diairization systems; Provides readers with a more enriching view of state of the art research in speech enhancement arising from novel multi-microphone and time-frequency solutions.
530 ▼a Issued also as a book.
538 ▼a Mode of access: World Wide Web.
650 0 ▼a Speech processing systems.
650 0 ▼a Automatic speech recognition.
700 1 ▼a Ogunfunmi, Tokunbo.
700 1 ▼a Togneri, Roberto.
700 1 ▼a Narasimha, Madihally (Sim).
856 4 0 ▼u https://oca.korea.ac.kr/link.n2s?url=http://dx.doi.org/10.1007/978-1-4939-1456-2
945 ▼a KLPA
991 ▼a E-Book(소장)

소장정보

No. 소장처 청구기호 등록번호 도서상태 반납예정일 예약 서비스
No. 1 소장처 중앙도서관/e-Book 컬렉션/ 청구기호 CR 006.454 등록번호 E14024929 도서상태 대출불가(열람가능) 반납예정일 예약 서비스 M

관련분야 신착자료

Negro, Alessandro (2026)
Dyer-Witheford, Nick (2026)