Book Description

The updated version of this book is available at https://dunderdata.com/master-data-analysis-with-python. Please purchase it there.

Master Data Analysis with Python is a comprehensive text dedicated to teaching the fundamentals of data analysis and visualization. It is divided into 10 parts:

  • Environment Setup and Jupyter Notebooks
  • Intro to pandas
  • Selecting Subsets of Data
  • Essential Commands
  • Grouping Data
  • Time Series
  • Regular Expressions
  • Tidy Data
  • Joining Data
  • Visualization


This book provides precise and modern approaches to doing data analysis with Python. To help you master the concepts, over 300 exercises with detailed solutions are available. There are also projects available that give you a chance to bring together multiple concepts and tools as you would in a real-life analysis.

Master Data Analysis with Python is written by Ted Petrou, author of the highly rated text Pandas Cookbook. This book is much better than Pandas Cookbook and covers substantially more topics with better explanations. Ted has spent the last 3 years teaching data science in-person using Python to hundreds of students and sees first hand exactly where students struggle. He has continually upgraded his material to minimize these struggles by providing simple and direct paths forward.

The goal for Master Data Analysis is to be the absolute best possible text to learn how to use the data analysis libraries available in Python to explore and understand data.

The book will receive continuous updates that you will have access to through at least 2020.

The primary Python library used during the book is pandas. Visualization will be handled by the matplotlib and seaborn libraries, both of which are covered in great detail. The best practices from the very latest versions of the libraries are used. 

*This book assumes you already have a solid understanding of the basics of Python. If you do not, you should master these fundamentals first.  Master the Fundamentals of Python also by Ted Petrou, provides the necessary prerequisite knowledge.

This book assumes no knowledge of any of the Python data science libraries. Each part progresses slowly beginning with the basics and ending with more advanced topics in the later chapters.

All the material is contained within both within Jupyter Notebooks and as a PDF. This allows you to open a notebook, read through the material, run the code, and then answer the exercises all within a single environment.

Master Machine Learning in Python is Ted's next book and will have a first draft released by the fall of 2019.