Description
Key Features
Master popular machine learning models including k-nearest neighbors, random forests, logistic regression, k-means, naive Bayes, and artificial neural networks
Learn how to build and evaluate performance of efficient models using scikit-learn
Practical guide to master your basics and learn from real life applications of machine learning
Book Description
Machine learning is the buzzword bringing computer science and statistics together to build smart and efficient models. Using powerful algorithms and techniques offered by machine learning you can automate any analytical model.
This book examines a variety of machine learning models including popular machine learning algorithms such as k-nearest neighbors, logistic regression, naive Bayes, k-means, decision trees, and artificial neural networks. It discusses data preprocessing, hyperparameter optimization, and ensemble methods. You will build systems that classify documents, recognize images, detect ads, and more. You will learn to use scikit-learn’s API to extract features from categorical variables, text and images; evaluate model performance, and develop an intuition for how to improve your model’s performance.
By the end of this book, you will master all required concepts of scikit-learn to build efficient models at work to carry out advanced tasks with the practical approach.
What you will learn
Review fundamental concepts such as bias and variance
Extract features from categorical variables, text, and images
Predict the values of continuous variables using linear regression and K Nearest Neighbors
Classify documents and images using logistic regression and support vector machines
Create ensembles of estimators using bagging and boosting techniques
Discover hidden structures in data using K-Means clustering
Evaluate the performance of machine learning systems in common tasks
About the Author
Gavin Hackeling is a data scientist and author. He was worked on a variety of machine learning problems, including automatic speech recognition, document classification, object recognition, and semantic segmentation. An alumnus of the University of North Carolina and New York University, he lives in Brooklyn with his wife and cat.
Table of Contents
The Fundamentals of Machine Learning
Simple linear regression
Classification and Regression with K Nearest Neighbors
Feature Extraction and Preprocessing
From Simple Regression to Multiple Regression
From Linear Regression to Logistic Regression
Naive Bayes
Nonlinear Classification and Regression with Decision Trees
From Decision Trees to Random Forests, and other Ensemble Methods
The Perceptron
From the Perceptron to Support Vector Machines
From the Perceptron to Artificial Neural Networks
Clustering with K-Means
Dimensionality Reduction with Principal Component Analysis