Wiley on Methods and Applications in Data Mining
ebook
(0)
Practical Text Mining With Perl
by Roger Bilisoly
Part 2 of the Wiley on Methods and Applications in Data Mining series
Provides readers with the methods, algorithms, and means to perform text mining tasks.
This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet (www.perl.org). It covers mining ideas from several perspectives-statistics, data mining, linguistics, and information retrieval-and provides readers with the means to successfully complete text mining tasks on their own.
The book begins with an introduction to regular expressions, a text pattern methodology, and quantitative text summaries, all of which are fundamental tools of analyzing text. Then, it builds upon this foundation to explore:
• Probability and texts, including the bag-of-words model
• Information retrieval techniques such as the TF-IDF similarity measure
• Concordance lines and corpus linguistics
• Multivariate techniques such as correlation, principal components analysis, and clustering
• Perl modules, German, and permutation tests
Each chapter is devoted to a single key topic, and the author carefully and thoughtfully introduces mathematical concepts as they arise, allowing readers to learn as they go without having to refer to additional books. The inclusion of numerous exercises and worked-out examples further complements the book's student-friendly format.
“Practical Text Mining with Perl” is ideal as a textbook for undergraduate and graduate courses in text mining and as a reference for a variety of professionals who are interested in extracting information from text documents.
ebook
(0)
Knowledge Discovery With Support Vector Machines
by Lutz H. Hamel
Part 3 of the Wiley on Methods and Applications in Data Mining series
An easy-to-follow introduction to support vector machines.
This book provides an in-depth, easy-to-follow introduction to support vector machines drawing only from minimal, carefully motivated technical and mathematical background material. It begins with a cohesive discussion of machine learning and goes on to cover:
• Knowledge discovery environments
• Describing data mathematically
• Linear decision surfaces and functions
• Perceptron learning
• Maximum margin classifiers
• Support vector machines
• Elements of statistical learning theory
• Multi-class classification
• Regression with support vector machines
• Novelty detection
Complemented with hands-on exercises, algorithm descriptions, and data sets, “Knowledge Discovery with Support Vector Machines” is an invaluable textbook for advanced undergraduate and graduate courses. It is also an excellent tutorial on support vector machines for professionals who are pursuing research in machine learning and related areas.
ebook
(0)
Pattern Recognition
A Quality of Data Perspective
by Wladyslaw Homenda
Part of the Wiley on Methods and Applications in Data Mining series
A new approach to the issue of data quality in pattern recognition.
Detailing foundational concepts before introducing more complex methodologies and algorithms, this book is a self-contained manual for advanced data analysis and data mining. Top-down organization presents detailed applications only after methodological issues have been mastered, and step-by-step instructions help ensure successful implementation of new processes. By positioning data quality as a factor to be dealt with rather than overcome, the framework provided serves as a valuable, versatile tool in the analysis arsenal.
For decades, practical need has inspired intense theoretical and applied research into pattern recognition for numerous and diverse applications. Throughout, the limiting factor and perpetual problem has been data-its sheer diversity, abundance, and variable quality presents the central challenge to pattern recognition innovation. Pattern Recognition: A Quality of Data Perspective repositions that challenge from a hurdle to a given and presents a new framework for comprehensive data analysis that is designed specifically to accommodate problem data.
Designed as both a practical manual and a discussion about the most useful elements of pattern recognition innovation, this book:
• Details fundamental pattern recognition concepts, including feature space construction, classifiers, rejection, and evaluation
• Provides a systematic examination of the concepts, design methodology, and algorithms involved in pattern recognition
• Includes numerous experiments, detailed schemes, and more advanced problems that reinforce complex concepts
• Acts as a self-contained primer toward advanced solutions, with detailed background and step-by-step processes
• Introduces the concept of granules and provides a framework for granular computing
Pattern recognition plays a pivotal role in data analysis and data mining, fields which are themselves being applied in an expanding sphere of utility. By facing the data quality issue head-on, this book provides students, practitioners, and researchers with a clear way forward amidst the ever-expanding data supply.
ebook
(0)
Data Science Using Python and R
by Chantal D. Larose
Part of the Wiley on Methods and Applications in Data Mining series
Learn data science by doing data science!
Data Science Using Python and R will get you plugged into the world's two most widespread open-source platforms for data science: Python and R.
Data science is hot. Bloomberg called data scientist "the hottest job in America." Python and R are the top two open-source data science tools in the world. In “Data Science Using Python and R”, you will learn step-by-step how to produce hands-on solutions to real-world business problems, using state-of-the-art techniques.
“Data Science Using Python and R” is written for the general reader with no previous analytics or programming experience. An entire chapter is dedicated to learning the basics of Python and R. Then, each chapter presents step-by-step instructions and walkthroughs for solving data science problems using Python and R.
Those with analytics experience will appreciate having a one-stop shop for learning how to do data science using Python and R. Topics covered include data preparation, exploratory data analysis, preparing to model the data, decision trees, model evaluation, misclassification costs, naïve Bayes classification, neural networks, clustering, regression modeling, dimension reduction, and association rules mining.
Further, exciting new topics such as random forests and general linear models are also included. The book emphasizes data-driven error costs to enhance profitability, which avoids the common pitfalls that may cost a company millions of dollars.
“Data Science Using Python and R” provides exercises at the end of every chapter, totaling over 500 exercises in the book. Readers will therefore have plenty of opportunity to test their newfound data science skills and expertise. In the Hands-on Analysis exercises, readers are challenged to solve interesting business problems using real-world data sets.
ebook
(0)
Discovering Knowledge in Data
An Introduction to Data Mining
by Daniel T. Larose
Part of the Wiley on Methods and Applications in Data Mining series
The field of data mining lies at the confluence of predictive analytics, statistical analysis, and business intelligence. Due to the ever-increasing complexity and size of data sets and the wide range of applications in computer science, business, and health care, the process of discovering knowledge in data is more relevant than ever before. This book provides the tools needed to thrive in today's big data world. The author demonstrates how to leverage a company's existing databases to increase profits and market share, and carefully explains the most current data science methods and techniques. The reader will "learn data mining by doing data mining". By adding chapters on data modelling preparation, imputation of missing data, and multivariate statistical analysis, “Discovering Knowledge in Data”, Second Edition remains the eminent reference on data mining.
• The second edition of a highly praised, successful reference on data mining, with thorough coverage of big data applications, predictive analytics, and statistical analysis.
• Includes new chapters on Multivariate Statistics, Preparing to Model the Data, and Imputation of Missing Data, and an Appendix on Data Summarization and Visualization
• Offers extensive coverage of the R statistical programming language
• Contains 280 end-of-chapter exercises
• Includes a companion website for university instructors who adopt the book
ebook
(0)
Data Mining and Learning Analytics
Applications in Educational Research
by Various Authors
Part of the Wiley on Methods and Applications in Data Mining series
Addresses the impacts of data mining on education and reviews applications in educational research teaching, and learning.
This book discusses the insights, challenges, issues, expectations, and practical implementation of data mining (DM) within educational mandates. Initial series of chapters offer a general overview of DM, Learning Analytics (LA), and data collection models in the context of educational research, while also defining and discussing data mining's four guiding principles-prediction, clustering, rule association, and outlier detection. The next series of chapters showcase the pedagogical applications of Educational Data Mining (EDM) and feature case studies drawn from Business, Humanities, Health Sciences, Linguistics, and Physical Sciences education that serve to highlight the successes and some of the limitations of data mining research applications in educational settings. The remaining chapters focus exclusively on EDM's emerging role in helping to advance educational research-from identifying at-risk students and closing socioeconomic gaps in achievement to aiding in teacher evaluation and facilitating peer conferencing. This book features contributions from international experts in a variety of fields.
• Includes case studies where data mining techniques have been effectively applied to advance teaching and learning
• Addresses applications of data mining in educational research, including: social networking and education; policy and legislation in the classroom; and identification of at-risk students
• Explores Massive Open Online Courses (MOOCs) to study the effectiveness of online networks in promoting learning and understanding the communication patterns among users and students
• Features supplementary resources including a primer on foundational aspects of educational mining and learning analytics
“Data Mining and Learning Analytics: Applications in Educational Research” is written for both scientists in EDM and educators interested in using and integrating DM and LA to improve education and advance educational research.
Showing 1 to 6 of 6 results