HOME > 상세정보

상세정보

Text analytics with Python : a practitioner's guide to natural language processing / 2nd ed

Text analytics with Python : a practitioner's guide to natural language processing / 2nd ed (3회 대출)

자료유형
단행본
개인저자
Sarkar, Dipanjan.
서명 / 저자사항
Text analytics with Python : a practitioner's guide to natural language processing / Dipanjan Sarkar.
판사항
2nd ed.
발행사항
[S.l.] :   Apress,   c2019.  
형태사항
xxiv, 674 p. : ill. ; 26 cm.
ISBN
9781484243534
일반주기
Includes index.  
000 00000nam u2200205 a 4500
001 000046032498
005 20200615162434
008 200615s2019 xx a 001 0 eng d
020 ▼a 9781484243534
040 ▼a 211009 ▼c 211009 ▼d 211009
082 0 4 ▼a 005.133 ▼2 23
084 ▼a 005.133 ▼2 DDCK
090 ▼a 005.133 ▼b S245t2
100 1 ▼a Sarkar, Dipanjan.
245 1 0 ▼a Text analytics with Python : ▼b a practitioner's guide to natural language processing / ▼c Dipanjan Sarkar.
250 ▼a 2nd ed.
260 ▼a [S.l.] : ▼b Apress, ▼c c2019.
300 ▼a xxiv, 674 p. : ▼b ill. ; ▼c 26 cm.
500 ▼a Includes index.
945 ▼a KLPA

소장정보

No. 소장처 청구기호 등록번호 도서상태 반납예정일 예약 서비스
No. 1 소장처 중앙도서관/서고6층/ 청구기호 005.133 S245t2 등록번호 111829568 (3회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M

컨텐츠정보

책소개

Leverage Natural Language Processing (NLP) in Python and learn how to set up your own robust environment for performing text analytics. This second edition has gone through a major revamp and introduces several significant changes and new topics based on the recent trends in NLP. 

You’ll see how to use the latest state-of-the-art frameworks in NLP, coupled with machine learning and deep learning models for supervised sentiment analysis powered by Python to solve actual case studies. Start by reviewing Python for NLP fundamentals on strings and text data and move on to engineering representation methods for text data, including both traditional statistical models and newer deep learning-based embedding models. Improved techniques and new methods around parsing and processing text are discussed as well.   

Text summarization and topic models have been overhauled so the book showcases how to build, tune, and interpret topic models in the context of an interest dataset on NIPS conference papers. Additionally, the book covers text similarity techniques with a real-world example of movie recommenders, along with sentiment analysis using supervised and unsupervised techniques.

There is also a chapter dedicated to semantic analysis where you’ll see how to build your own named entity recognition (NER) system from scratch. While the overall structure of the book remains the same, the entire code base, modules, and chapters has been updated to the latest Python 3.x release.


What You'll Learn

? Understand NLP and text syntax, semantics and structure
? Discover text cleaning and feature engineering
? Review text classification and text clustering 
? Assess text summarization and topic models
? Study deep learning for NLP

Who This Book Is For

IT professionals, data analysts, developers, linguistic experts, data scientists and engineers and basically anyone with a keen interest in linguistics, analytics and generating insights from textual data.


New feature

Leverage Natural Language Processing (NLP) in Python and learn how to set up your own robust environment for performing text analytics. The second edition of this book will show you how to use the latest state-of-the-art frameworks in NLP, coupled with Machine Learning and Deep Learning to solve real-world case studies leveraging the power of Python.

This edition has gone through a major revamp introducing several major changes and new topics based on the recent trends in NLP. We have a dedicated chapter around Python for NLP covering fundamentals on how to work with strings and text data along with introducing the current state-of-the-art open-source frameworks in NLP. We have a dedicated chapter on feature engineering representation methods for text data including both traditional statistical models and newer deep learning based embedding models. Techniques around parsing and processing text data have also been improved with some new methods.

Considering popular NLP applications, for text classification, we also cover methods for tuning and improving our models. Text Summarization has gone through a major overhaul in the context of topic models where we showcase how to build, tune and interpret topic models in the context of an interest dataset on NIPS conference papers. Similarly, we cover text similarity techniques with a real-world example of movie recommenders. Sentiment Analysis is covered in-depth with both supervised and unsupervised techniques. We also cover both machine learning and deep learning models for supervised sentiment analysis. Semantic Analysis gets its own dedicated chapter where we also showcase how you can build your own Named Entity Recognition (NER) system from scratch. To conclude things, we also have a completely new chapter on the promised of Deep Learning for NLP where we also showcase a hands-on example on deep transfer learning.

While the overall structure of the book remains the same, the entire code base, modules, and chapters will be updated to the latest Python 3.x release.
----------------------------------
Also the key selling points
? Implementations are based on Python 3.x and state-of-the-art popular open source libraries in NLP 
? Covers Machine Learning and Deep Learning for Advanced Text Analytics and NLP
? Showcases diverse NLP applications including Classification, Clustering, Similarity Recommenders, Topic Models, Sentiment and Semantic Analysis




정보제공 : Aladin

저자소개

디판잔 사카(지은이)

세계 최대의 반도체 회사인 인텔에서 애널리틱스, 비즈니스 인텔리전스, 애플리케이션 개발 업무를 수행하는 IT 엔지니어다. 인도 방갈로르의 국제정보기술공대 IT 학과에서 석사 학위를 받았으며, 소프트웨어 엔지니어링, 데이터 과학, 머신 러닝, 텍스트 애널리틱스가 전문 영역이다. 새로운 기술을 배우는 것을 포함해, 혁신적인 스타트업들과 데이터 과학에 관심을 가지고 있다. 책을 읽고, 게임을 하고, 유명한 시트콤을 보는 것을 좋아한다. 팩트출판사가 펴낸 『Data Analysis with R』, 『Learning R for Geospatial Analysis』, 『R Data Analysis Cookbook』의 감수자이기도 하다.

정보제공 : Aladin

목차

Chapter 1: Natural Language BasicsChapter Goal: Introduces the readers to the basics of NLP and Text processingNo of pages: 40 - 50 Sub -Topics1. Language Syntax and Structure2. Text formats and grammars3. Lexical and Text Corpora resources4. Deep dive into the Wordnet corpus5. Parts of speech, Stemming and lemmatization
Chapter 2: Python for Natural Language ProcessingChapter Goal: A useful chapter for people focusing on how to setup your own python environment for NLP and also some basics on handling text data with python and coverage of popular open source frameworks for NLPNo of pages: 20 - 30Sub - Topics 1. Setup Python for NLP2. Handling strings with Python3. Regular Expressions with Python4. Quick glance into nltk, gensim, spacy, scikit-learn, keras
Chapter 3: Processing and Understanding TextChapter Goal: This chapter covers all the techniques and capabilities needed for processing and parsing text into easy to understand formats. We also look at how to segment and normalize text. No of pages : 35 - 40Sub - Topics: 1. Sentence and word tokenization2. Text tagging and chunking3. Text Parse Trees3. Text normalization4. Text spell checks and removal of redundant characters5. Synonyms and Synsets
Chapter 4: Feature Engineering for Text DataChapter Goal: This chapter covers important strategies to extract meaningful features from unstructured text data. This includes traditional techniques as well as newer deep learning based methods. No of pages : 40 - 50Sub - Topics: 1. Feature engineering strategies for text data2. Bag of words model3. TF-IDF model3. Bag of N-grams model4. Topic Models5. Word Embedding based models (word2vec, glove)
Chapter 5: Text Classification
Chapter Goal: Introduces readers to the concept of classification as a supervised machine learning problem and looks at a real world example for classifying text documentsNo of pages: 30 - 40Sub - Topics: 1. Classification basics2. Types of classifiers3. Feature generation of text documents4. Binary and multi-class classification models5. Building a text classifier on real world data with machine learning6. Some coverage of deep learning based classifiers7. Evaluating Classifiers
Chapter 6: Text summarization and topic modelingChapter Goal: Introduces the concepts of text summarization, n-gram tagging analysis and topic models to the readers and looks at some real world datasets and hands-on implementations on the sameNo of pages: 40 - 45Sub - Topics: 1. Text summarization concepts2. Dimensionality reduction3. N-gram tagging models4. Topic modeling using LDA and LSA5. Generate topics from real world data6. N-gram analysis to generate patterns from app reviews (only if it performs well)7. Basics on deep learning for summarization

Chapter 7: Text Clustering and Similarity analysisChapter Goal: We look at unsupervised machine learning concepts here like text clustering and similarity measuresNo of pages: 35 - 40Sub - Topics: 1. Clustering concepts2. Analyzing text similarity3. Implementing text similarity with cosine, jaccard measures4. Text clustering algorithms5. Coverage of partition based clustering like k-means clustering as well as hierarchical clustering methods in detail 6. Hands on text clustering example on real world data
Chapter 8: Sentiment Analysis Chapter Goal: We look at solving a popular problem of analyzing sentiment from text using a combination of methods learnt earlier including classification and also lexical analysisNo of pages: 35 - 40Sub - Topics: 1. What is sentiment analysis2. Looking at lexical corpora for sentiment 3. Unsupervised sentiment analysis using lexical methods (hands-on)4. Supervised sentiment analysis (hands-on)
Chapter 9: Deep learning in NLPChapter Goal: Deep Learning is one of the most trending topics in the machine learning and data science space these days. Here we will cover a brief introduction into the promise deep learning holds for text analytics and NLP.No of pages: 30 - 35Sub - Topics: 1. What is Deep Learning2. Deep learning for text classification (concepts only)3. Deep learning for natural language generation (concepts only)4. Deep learning for text summarization (concepts only)

관련분야 신착자료

Harvard Business Review (2025)