Python Data Visualization Essentials Guide
eBook - ePub

Python Data Visualization Essentials Guide

Become a Data Visualization expert by building strong proficiency in Pandas, Matplotlib, Seaborn, Plotly, Numpy, and Bokeh

Kalilur Rahman

Share book
  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

Python Data Visualization Essentials Guide

Become a Data Visualization expert by building strong proficiency in Pandas, Matplotlib, Seaborn, Plotly, Numpy, and Bokeh

Kalilur Rahman

Book details
Book preview
Table of contents
Citations

About This Book

Build your data science skills. Start data visualization Using Python. Right away. Become a good data analyst by creating quality data visualizations using Python.

Key Features
? Exciting coverage on loads of Python libraries, including Matplotlib, Seaborn, Pandas, and Plotly.
? Tons of examples, illustrations, and use-cases to demonstrate visual storytelling of varied datasets.
? Covers a strong fundamental understanding of exploratory data analysis (EDA), statistical modeling, and data mining.

Description
Data visualization plays a major role in solving data science challenges with various capabilities it offers. This book aims to equip you with a sound knowledge of Python in conjunction with the concepts you need to master to succeed as a data visualization expert.The book starts with a brief introduction to the world of data visualization and talks about why it is important, the history of visualization, and the capabilities it offers. You will learn how to do simple Python-based visualization with examples with progressive complexity of key features. The book starts with Matplotlib and explores the power of data visualization with over 50 examples. It then explores the power of data visualization using one of the popular exploratory data analysis-oriented libraries, Pandas.The book talks about statistically inclined data visualization libraries such as Seaborn. The book also teaches how we can leverage bokeh and Plotly for interactive data visualization. Each chapter is enriched and loaded with 30+ examples that will guide you in learning everything about data visualization and storytelling of mixed datasets.

What you will learn
? Learn to work with popular Python libraries and frameworks, including Seaborn, Bokeh, and Plotly.
? Practice your data visualization understanding across numerous datasets and real examples.
? Learn to visualize geospatial and time-series datasets.
? Perform correlation and EDA analysis using Pandas and Matplotlib.

Who this book is for
This book is for all data analytics professionals, data scientists, and data mining hobbyists who want to be strong data visualizers by learning all the popular Python data visualization libraries. Prior working knowledge of Python is assumed.

Table of Contents
1. Introduction to Data Visualization
2. Why Data Visualization
3. Various Data Visualization Elements and Tools
4. Using Matplotlib with Python
5. Using NumPy and Pandas for Plotting
6. Using Seaborn for Visualization
7. Using Bokeh with Python
8. Using Plotly, Folium, and Other Tools for Data Visualization
9. Hands-on Examples and Exercises, Case Studies, and Further Resources

About the Authors
Kallur Rahman is an IT industry leader with over 2 decades of experience in software development, testing, program/ project management, and management consultancy. He has been a developer, designer, technical architect, test program manager, delivery unit head, IT Services, and COE/Factory Services leader of various complexity spanning telecommunications, Life Sciences, Retail, and Healthcare Industries. He has a master's degree in Business Administration preceded by an Engineering degree in Computer Science. He has counseled CxO level executives in market-leading corporations for testing, business and technology transformation programs. As a thought-leader, he is a frequent invitee at several industry events spanning technical and domain-focused themes.
He is a believer in "Knowledge is Power" and is passionate about authoring and sharing his knowledge. He has published over 200 articles across LinkedIn, DevOps.Com, and other leading magazines.He is additionally an active quizzing aficionado who engages and contributes at corporate level quizzing. LinkedIn Bio: https://www.linkedin.com/in/kalilurrahman/

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Python Data Visualization Essentials Guide an online PDF/ePUB?
Yes, you can access Python Data Visualization Essentials Guide by Kalilur Rahman in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Visualisation. We have over one million books available in our catalogue for you to explore.

Information

Year
2021
ISBN
9789391030063

CHAPTER 1

Introduction to Data Visualization

"Human visual perception is grounded on a single set of biological and psychological principles, regardless of culture. Cultural differences should be taken into account but be aware that there are common foundations for what we do."
– Alberto Cairo
Data Visualization and data storytelling have taken the world by storm. Visualization skills are one of the hot skills in the market. Like the introduction, the focus of this book is to give an introduction to data visualization using Python as a primary tool of choice. Specifically, coming to the objective of this chapter, the idea is to introduce data visualization and the difference between various aspects of analysis and the importance of data visualization to the world of business analysis, product management, and its correlation. The importance of data visualization is so big that it determines the winners and losers. Some of the leading technology companies are successful due to their ability to derive insights into their customers' data, products, and services. Some of the large companies with a goldmine are data are not as successful as they should be due to the lack of ability to generate actionable insights.

Structure

In this chapter, we will cover the following topics:
  • What is data visualization
  • Key elements of data visualization
  • Importance of data visualization

Objective

This chapter aims at giving a good amount of introduction to what data visualization is about, why data visualization is important, how it evolved over time, and its key elements. This chapter will set the context for approaching data visualization from atechnology solution point of view using Python later in the chapters.
Data visualization is a visual art of storytelling with an intent to share insights with a meaningful purpose. Data visualization leverages graphical elements such as graphs, charts, maps, and other elements to produce a meaningful graphical representation of data and information. It is a powerful way to share insights on trends, patterns, and outliers in a set of data. The users can analyze the patterns and gain an insight into the data shared. It can be said that visualization is an art due to the creative aspects involved. Data visualization is a very powerful concept in today's world and an important skill to imbibe to succeed.

What is data visualization?

Human brains are trained to spot patterns through our experiential learning throughout life. There is a popular adage - A picture is worth a thousand words. Our eyes are attracted to patterns and colors. In combination with the brain's cognitive abilities, we are attracted by visuals that make an impact. It could be an image, scenery, or an image in a movie or a TV advertisement.
Visualization is a process that transforms the representation of real data from something into something meaningful in a visual representation. The key is meaningful rendering in a simple visualization, even for complex data. Similar to the quote, a picture is worth a thousand words, a good visualization tells a story simply and efficiently, like a good painting. It is visual art that can be used as a powerful storytelling tool.
In literature, I can't say that story A is better than poem B; I have to compare stories with stories and poems with poems, despite being all literature. The same applies to data visualization.
- Jorge Camões
Data visualization, in a metaphorical way, is one way to leverage the visual art of storytelling. Visualization is an intent to share insights with a meaningful purpose. Data visualization leverages graphical elements such as graphs, charts, maps, and other elements to produce a meaningful graphical representation of data and information. It is a powerful way to share insights on trends, patterns, and outliers in a set of data. The users can analyze the patterns and gain an insight into the data shared. It can be said that visualization is an art due to the creative aspects involved.
Data visualization is both an art (visuals) and science (the method of rendering the data) combined. Data visualization can be leveraged to display insights of both quantitative and qualitative data under analysis1.
The scientific part of data visualization is done using software or libraries available for rendering graphical visualization. This book is primarily dedicated to this aspect, focusing on a particular language we've chosen – Python.
The French cave paintings, at Chauvet-Pont-d'Arc Cave in France, albeit an art form, showcase some inspiring elements of visualization. Some of the themes used in data visualization were, in a way, let's say, inspired by some of the visual art forms that followed for years. It progressed further with abbreviations used for tax notifications by governments to simple graphs of a line graph or a bar chart before graduating to mind-boggling real-time interactive visualization.

Brilliant use of data visualization in history

Some of the greatest visualization examples include the following:
  1. Visualization of the cholera deaths by John Snow, known as the father of epidemiology, is a study of disease and patterns to identify measures to solve the issues. He visualized the cholera deaths for a London borough. While visualizing the outbreak in the city, he noted that the number of deaths at a particular street (Broad Street) near a water pump was high. This led to an insight that cholera was caused by germ-contaminated water than particles in the air. This changed the course of medicine and treatments for outbreaks.
  2. The brilliant use of wonderful data visualization by Florence Nightingale to record the causes of mortality during the Crimean war. Her fact-oriented visualizations proved that more soldiers died due to infections than that of actual fatality in the war. Her visualizations proved the power of inference of data. Data visualizations thrive on the power of insights and inference, and Nightingale's visualization brought the idea of a single picture being more powerful than thousand words to the fore. Florence Nightingale also produced other data visualization charts to prove a point to the government, healthcare professionals, and the public that sanitation is key for healthy lives. Florence Nightingale, also known as "The Lady with the Lamp," was a pioneering icon in statistics and data visualization.
Data visualization has transformed many organizations to become wildly successful and has helped governments make decisions to improve the lives of the citizens. Some of the data used are sales and profit numbers, market coverage, employee productivity, etc. It could be budget and revenue figures, health indicators for citizens, employment data, and education data to make policy decisions for governments. For humans, one major use case of impactful contribution by data visualization is the efficient usage to expand the average lifespan.
  • By helping the healthcare professionals to do the right type of diagnosis and analyzing to understand the patterns and outliers
  • By focusing on statistically important aspects to build procedures, discover and develop medicines, and choose treatment
  • By giving an insight into the trends, progress, and to make an informed decision for the betterment of business
Data visualization is a powerful way to tell a visual story that can help determine outliers, patterns, trends, and correlations of data available and make meaningful decisions.

Key elements of data visualization

In the 21st century, data visualization has picked a lot of momentum with the advent of increased use of artificial intelligence and data science. The use of data visualization for research and development, education, and commercial usage have expanded exponentially using interactive dashboards, infographics, and other data rendering tools. It is used in every aspect of our daily life. It is a lot easier to generate powerful visual stories, and I see at least 5 to 10 data visualization elements daily.
One of the keys to the success of the popularity is the evolution of data visualization as an art and science discipline. If you take the examples of visualizations by Florence Nightingale and John Snow, both took a considerable amount of manual analysis and attention to details and great application of key data visualization elements to make them ground-breaking. There are many definitions of key elements that vary from designer-to-designer and author-to-author. From a simplicity standpoint, let us see some of the key elements of data visualization.

Elements of data visualization

There are plenty of guides available covering the key elements and themes to be considered for effective data visualization. We shall cover some of the essential elements to focus on and consider while designing data visualization. Let us see the key elements for data visualization in a diagram. We can call this DUSSSS – Data, User, Strategy, Structure, Style, and Story.
Figure 1.1: Elements of data visualization
Figure 1.1 shows the key elements in a simple visualization in the form of a pictorial. We shall cover each of the elements in detail. We shall explore each element in a question-and-answer format. At a high level, the six key elements to focus on are:
  • Strategy: What is your data visualization strategy?
  • Structure: How are you planning to structure your story?
  • Data: What type of data are you planning to use? How many datasets are you planning to use?
  • Style: A key element on your visualization style, choice of visualization elements such as graphs and charts, choice of colors and other visual elements, use of qualitative and quantitative information to convey a message
  • User: The key to the success of the data visualization exercise. Who are your users? Why should they be using your product? What is the key takeaway for them?
  • Story: Most important aspect of the exercise. What are you trying to convey, what would the key insights, messages, actions, inspirations they can take away to implement actions?
Let us delve into a bit more detail on these elements.

Strategy

Having a good strategy for your visualization exercise is very important. This is because the visualization outcome is purely based on the data being used. Like bad content or a theme could derail a movie or an advertisement, bad data can result in poor outcomes in story, elegance, insights, etc. Having a good strategy is important for a visualization exercise. This includes data strategy, design strategy in terms of user persona and visual elements, etc.
  • Having good data capturing, data extracting, data cleansing, data integration strategy is very important. This strategy is especially important for planning interactive, real-time, update-oriented dashboards and data visualization. There should be a data strategy for the visualization exercise.
  • Another element to consider is the user experience and design thinking strategy to address the needs and wants of the users. Using a persona-based design of visual elements can help in designing better visualization elements.
    • As one size fits all does not exist, a designer bias can be avoided by taking the user requirements, and user needs into consideration through the empathy-based user-centric design of elements.
    • Design elements, style elements, visual themes, templates, messaging, colors, form factors, devices, and gesture-based themes and actions can all be thought in advance.
    • Having a clear structure, simplicity, better visibility, and consistency in design could be thought through before the design is done.

Story

Data visualization can be used for two purposes when it...

Table of contents