Bash for Data Scientists
by Campesato, Oswald;
English | 2022 | ISBN: 168392973X | 293 pages | True PDF | 5.87 MB
This book introduces an assortment of powerful command line utilities
that can be combined to create simple, yet powerful shell scripts for processing datasets.
The code samples and scripts use the bash shell, and typically involve small datasets so
you can focus on understanding the features of grep, sed, and awk. Companion files
with code are available for downloading from the publisher.
Features
+Provides the reader with powerful command line utilities that can be combined to
create simple yet powerful shell scripts for processing datasets
+Contains a variety of code fragments and shell scripts for data scientists, data analysts,
and those who want shell-based solutions to "clean" various types of datasets
+Companion files with code available for downloading with Amazon proof of
purchase by writing to the publisher.
Table of Contents
1: Introduction to UNIX. 2: Files and Directories. 3: Useful Commands.
4: Conditional Logic and Loops. 5: Processing Datasets with grep and sed.
6: Processing Datasets with awk. 7: Processing Datasets (Pandas).
8: NoSQL, SQLite, and Python. Index.
About the Author
Oswald Campesato (San Francisco, CA) is an adjunct instructor
at UC-Santa Clara and specializes in Deep Learning, Java, Android,
and NLP. He is the author of over twenty-five books including the
SQL Pocket Primer, Python 3 for Machine Learning, and the
NLP Using R Pocket Primer (all Mercury Learning).
Links are Interchangeable - No Password - Single Extraction