Data Science for Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Tommy Blanchard, Debasish Behera, Pranshu Bhatnagar
- 420 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Data Science for Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Tommy Blanchard, Debasish Behera, Pranshu Bhatnagar
About This Book
Explore new and more sophisticated tools that reduce your marketing analytics efforts and give you precise results
Key Features
- Study new techniques for marketing analytics
- Explore uses of machine learning to power your marketing analyses
- Work through each stage of data analytics with the help of multiple examples and exercises
Book Description
Data Science for Marketing Analytics covers every stage of data analytics, from working with a raw dataset to segmenting a population and modeling different parts of the population based on the segments.
The book starts by teaching you how to use Python libraries, such as pandas and Matplotlib, to read data from Python, manipulate it, and create plots, using both categorical and continuous variables. Then, you'll learn how to segment a population into groups and use different clustering techniques to evaluate customer segmentation. As you make your way through the chapters, you'll explore ways to evaluate and select the best segmentation approach, and go on to create a linear regression model on customer value data to predict lifetime value. In the concluding chapters, you'll gain an understanding of regression techniques and tools for evaluating regression models, and explore ways to predict customer choice using classification algorithms. Finally, you'll apply these techniques to create a churn model for modeling customer product choices.
By the end of this book, you will be able to build your own marketing reporting and interactive dashboard solutions.
What you will learn
- Analyze and visualize data in Python using pandas and Matplotlib
- Study clustering techniques, such as hierarchical and k-means clustering
- Create customer segments based on manipulated data
- Predict customer lifetime value using linear regression
- Use classification algorithms to understand customer choice
- Optimize classification algorithms to extract maximal information
Who this book is for
Data Science for Marketing Analytics is designed for developers and marketing analysts looking to use new, more sophisticated tools in their marketing analytics efforts. It'll help if you have prior experience of coding in Python and knowledge of high school level mathematics. Some experience with databases, Excel, statistics, or Tableau is useful but not necessary.
Frequently asked questions
Information
Chapter 1
Data Preparation and Cleaning
Learning Objectives
- Create pandas DataFrames in Python
- Read and write data into different file formats
- Slice, aggregate, filter, and apply functions (built-in and custom) to DataFrames
- Join DataFrames, handle missing values, and combine different data sources
Introduction
Data Models and Structured Data
- Structured Data: This is also known as completely structured or well-structured data. This is the simplest way to manage information. The data is arranged in a flat tabular form with the correct value corresponding to the correct attribute. There is a unique column, known as an index, for easy and quick access to the data, and there are no duplicate columns. Data can be queried exactly through SQL queries, for example, data in relational databases, MySQL, Amazon Redshift, and so on.
- Semi-structured data: This refers to data that may be of variable lengths and that may contain different data types (such as numerical or categorical) in the same column. Such data may be arranged in a nested or hierarchical tabular structure, but it still follows a fixed schema. There are no duplicate columns (attributes), but there may be duplicate rows (observations). Also, each row might not contain values for every attribute, that is, there may be missing values. Semi-structured data can be stored accurately in NoSQL databases, Apache Parquet files, JSON files, and so on.
- Unstructured data: Data that is unstructured may not be tabular, and even if it is tabular, the number of attributes or columns per observation may be completely arbitrary. The same data could be represented in different ways, and the attributes might not match each other, with values leaking into other parts. Unstructured data can be stored as text files, CSV files,...