| 000 | 00000nam u2200205 a 4500 | |
| 001 | 000046063322 | |
| 005 | 20210112131033 | |
| 008 | 210111s2020 si a 000 0 eng d | |
| 020 | ▼a 9789811561320 | |
| 040 | ▼a 211009 ▼c 211009 ▼d 211009 | |
| 082 | 0 4 | ▼a 006.3 ▼2 23 |
| 084 | ▼a 006.3 ▼2 DDCK | |
| 090 | ▼a 006.3 ▼b Q999d | |
| 100 | 1 | ▼a Qamar, Usman. |
| 245 | 1 0 | ▼a Data science concepts and techniques with applications / ▼c Usman Qamar, Muhammad Summair Raza. |
| 260 | ▼a Singapore : ▼b Springer, ▼c 2020. | |
| 300 | ▼a xv, 196 p. : ▼b ill. (some col.) ; ▼c 25 cm. | |
| 650 | 0 | ▼a Data mining. |
| 650 | 0 | ▼a Artificial intelligence. |
| 650 | 0 | ▼a Big data. |
| 700 | 1 | ▼a Raza, Muhammad Summair. |
| 945 | ▼a KLPA |
소장정보
| No. | 소장처 | 청구기호 | 등록번호 | 도서상태 | 반납예정일 | 예약 | 서비스 |
|---|---|---|---|---|---|---|---|
| No. 1 | 소장처 과학도서관/Sci-Info(2층서고)/ | 청구기호 006.3 Q999d | 등록번호 121256062 | 도서상태 대출가능 | 반납예정일 | 예약 | 서비스 |
컨텐츠정보
책소개
This book comprehensively covers the topic of data science. Data science is an umbrella term that encompasses data analytics, data mining, machine learning, and several other related disciplines. This book synthesizes both fundamental and advanced topics of a research area that has now reached maturity. The chapters of this book are organized into three sections:
- The first section is an introduction to data science. Starting from the basic concepts, the book will highlight the types of data, its use, its importance and issues that are normally faced in data analytics. Followed by discussion on wide range of applications of data science and widely used techniques in data science.
- The second section is devoted to the tools and techniques of data science. It consists of data pre-processing, feature selection, classification and clustering concepts as well as an introduction to text mining and opining mining.
- And finally, the third section of the book focuses on two programming languages commonly used for data science projects i.e. Python and R programming language.
Although this book primarily serves as a textbook, it will also appeal to industrial practitioners and researchers due to its focus on applications and references. The book is suitable for both undergraduate and postgraduate students as well as those carrying out research in data science. It can be used as a textbook for undergraduate students in computer science, engineering and mathematics. It can also be accessible to undergraduate students from other areas with the adequate background. The more advanced chapters can be used by postgraduate researchers intending to gather a deeper theoretical understanding.
New feature
This book comprehensively covers the topic of data science. Data science is an umbrella term that encompasses data analytics, data mining, machine learning, and several other related disciplines. This book synthesizes both fundamental and advanced topics of a research area that has now reached maturity. The chapters of this book are organized into three sections:
- The first section is an introduction to data science. Starting from the basic concepts, the book will highlight the types of data, its use, its importance and issues that are normally faced in data analytics. Followed by discussion on wide range of applications of data science and widely used techniques in data science.
- The second section is devoted to the tools and techniques of data science. It consists of data pre-processing, feature selection, classification and clustering concepts as well as an introduction to text mining and opining mining.
- And finally, the third section of the book focuses on two programming languages commonly used for data science projects i.e. Python and R programming language.
Although this book primarily serves as a textbook, it will also appeal to industrial practitioners and researchers due to its focus on applications and references. The book is suitable for both undergraduate and postgraduate students as well as those carrying out research in data science. It can be used as a textbook for undergraduate students in computer science, engineering and mathematics. It can also be accessible to undergraduate students from other areas with the adequate background. The more advanced chapters can be used by postgraduate researchers intending to gather a deeper theoretical understanding.
정보제공 :
목차
Section-1: Data Science - The "What" Chapter-1: IntroductionFirst chapter will set the basic foundation of the subject for students. Like many other books, this introductory level chapter will comprise of the basic concepts. Introduction of the following concepts will be discussed:* Data Science* Importance of data science* Applications of data science* Data Driven Decision Making* Data analysisChapter-2: Widely used techniques in data scienceThis chapter will discuss the concepts required for one to start working on data analysis. Chapter will comprise of the concepts that student should know before performing any task on data analysis and some of the tasks that can be performed as part of data analysis. Following concepts will be discussed.* Supervised vs Unsupervised data* Data understanding* Data preparation* Modeling* Overfitting* Random sampling* Cross Validation* Feature selection* Outlier detection* Rule extractionSection-2: Data science: The "How" Chapter-3: Statistical InferenceEvery part of data analysis involves statistics and statistical inference to properly utilize data and perform decision making. This chapter will provide statistical concepts to support the data analysis tasks performed by students for decision making with real life data. Following topics will be discussed:* Probability theory* Transformations and expectations* Common families of distribution* Random variables* Preparation of random samples* Asymptotic evaluations* Regression and regression models Chapter-4: Supervised Learning In real world, we come across two types of data, supervised and unsupervised. In this chapter, we will discuss the concepts, tools and techniques related to processing of supervised data with examples and decision making out of it. The following concepts will be discussed:* Supervised Learning* Classification and Regression* Generalization, Overfitting and Underfitting* Evaluation models* Supervised learning algorithmsChapter-5: Unsupervised LearningThe unsupervised data forms the other half of the data available in real world applications. Like previous chapter, this chapter will include the concepts, tools and techniques related to unsupervised data with examples. Following contents will be included:* Challenges of unsupervised learning* Processing and scaling* Clustering* Dimensionality reduction, feature extraction and manifold learning* Unsupervised learning algorithmsChapter-6: Natural language processingIn this chapter, we will focus on one particular sort of data that has become extremely common i.e. text data. We will see in this chapter the fundamental principles of natural language processing and will look at one of the common application of NLP that is sentiment analysis. Following contents will be discussed:* Why Text Is Important* Why Text Is Difficult* Representation* Sentiment Analysis* Lexicon-based Approaches for Text MiningSection-3: Data Science - The "Where" Chapter-7: Customers AnalyticsIn this chapter, we will introduce he use of analytics for understanding customers and predicting their behaviour in different situations. This includes the understanding of loyalty programs, market research, understanding customer lifetime value, predicting churn, and identifying potential defaulters. These are few examples of what will be contained in this chapter. Chapter-8: Operations AnalyticsIn this chapter, we will prepare our readers to understand and acknowledge the use of data science for improving business operations. For example, we will discuss how analyzing data can help avoid service outages, or at least predict the service outage in order to prepare contingency plans. Analyzing data can also help in identifying redundancies which can be removed in order to significantly reduce operational costs. We will give examples on how various manufacturing and service industries are using real-time sensor data to track their systems wear and tear. This helps them improve their mean time to repair by forecasting breakdown of different components well ahead in time.
