Externally indexed torrent
If you are the original uploader, contact staff to have it moved to your account
Textbook in PDF format
This book introduces an assortment of powerful command line utilities that can be combined to create simple, yet powerful shell scripts for processing datasets. The code samples and scripts use the bash shell, and typically involve small datasets so you can focus on understanding the features of grep, sed, and awk. Companion files with code are available for downloading from the publisher.
Features
Provides the reader with powerful command line utilities that can be combined to create simple yet powerful shell scripts for processing datasets.
Contains a variety of code fragments and shell scripts for data scientists, data analysts, and those who want shell-based solutions to “clean” various types of datasets.
Companion files with code available for downloading with Amazon proof of purchase by writing to the publisher.
Introduction to UNIX.
Files and Directories.
Useful Commands.
Conditional Logic and Loops.
Processing Datasets with grep and sed.
Processing Datasets with awk.
Processing Datasets (Pandas).
NoSQL, SQLite, and Python