Python Data Visualization Essentials Guide
eBook - ePub

Python Data Visualization Essentials Guide

Become a Data Visualization expert by building strong proficiency in Pandas, Matplotlib, Seaborn, Plotly, Numpy, and Bokeh

Kalilur Rahman

Condividi libro
  1. English
  2. ePUB (disponibile sull'app)
  3. Disponibile su iOS e Android
eBook - ePub

Python Data Visualization Essentials Guide

Become a Data Visualization expert by building strong proficiency in Pandas, Matplotlib, Seaborn, Plotly, Numpy, and Bokeh

Kalilur Rahman

Dettagli del libro
Anteprima del libro
Indice dei contenuti
Citazioni

Informazioni sul libro

Build your data science skills. Start data visualization Using Python. Right away. Become a good data analyst by creating quality data visualizations using Python.

Key Features
? Exciting coverage on loads of Python libraries, including Matplotlib, Seaborn, Pandas, and Plotly.
? Tons of examples, illustrations, and use-cases to demonstrate visual storytelling of varied datasets.
? Covers a strong fundamental understanding of exploratory data analysis (EDA), statistical modeling, and data mining.

Description
Data visualization plays a major role in solving data science challenges with various capabilities it offers. This book aims to equip you with a sound knowledge of Python in conjunction with the concepts you need to master to succeed as a data visualization expert.The book starts with a brief introduction to the world of data visualization and talks about why it is important, the history of visualization, and the capabilities it offers. You will learn how to do simple Python-based visualization with examples with progressive complexity of key features. The book starts with Matplotlib and explores the power of data visualization with over 50 examples. It then explores the power of data visualization using one of the popular exploratory data analysis-oriented libraries, Pandas.The book talks about statistically inclined data visualization libraries such as Seaborn. The book also teaches how we can leverage bokeh and Plotly for interactive data visualization. Each chapter is enriched and loaded with 30+ examples that will guide you in learning everything about data visualization and storytelling of mixed datasets.

What you will learn
? Learn to work with popular Python libraries and frameworks, including Seaborn, Bokeh, and Plotly.
? Practice your data visualization understanding across numerous datasets and real examples.
? Learn to visualize geospatial and time-series datasets.
? Perform correlation and EDA analysis using Pandas and Matplotlib.

Who this book is for
This book is for all data analytics professionals, data scientists, and data mining hobbyists who want to be strong data visualizers by learning all the popular Python data visualization libraries. Prior working knowledge of Python is assumed.

Table of Contents
1. Introduction to Data Visualization
2. Why Data Visualization
3. Various Data Visualization Elements and Tools
4. Using Matplotlib with Python
5. Using NumPy and Pandas for Plotting
6. Using Seaborn for Visualization
7. Using Bokeh with Python
8. Using Plotly, Folium, and Other Tools for Data Visualization
9. Hands-on Examples and Exercises, Case Studies, and Further Resources

About the Authors
Kallur Rahman is an IT industry leader with over 2 decades of experience in software development, testing, program/ project management, and management consultancy. He has been a developer, designer, technical architect, test program manager, delivery unit head, IT Services, and COE/Factory Services leader of various complexity spanning telecommunications, Life Sciences, Retail, and Healthcare Industries. He has a master's degree in Business Administration preceded by an Engineering degree in Computer Science. He has counseled CxO level executives in market-leading corporations for testing, business and technology transformation programs. As a thought-leader, he is a frequent invitee at several industry events spanning technical and domain-focused themes.
He is a believer in "Knowledge is Power" and is passionate about authoring and sharing his knowledge. He has published over 200 articles across LinkedIn, DevOps.Com, and other leading magazines.He is additionally an active quizzing aficionado who engages and contributes at corporate level quizzing. LinkedIn Bio: https://www.linkedin.com/in/kalilurrahman/

Domande frequenti

Come faccio ad annullare l'abbonamento?
È semplicissimo: basta accedere alla sezione Account nelle Impostazioni e cliccare su "Annulla abbonamento". Dopo la cancellazione, l'abbonamento rimarrà attivo per il periodo rimanente già pagato. Per maggiori informazioni, clicca qui
È possibile scaricare libri? Se sì, come?
Al momento è possibile scaricare tramite l'app tutti i nostri libri ePub mobile-friendly. Anche la maggior parte dei nostri PDF è scaricabile e stiamo lavorando per rendere disponibile quanto prima il download di tutti gli altri file. Per maggiori informazioni, clicca qui
Che differenza c'è tra i piani?
Entrambi i piani ti danno accesso illimitato alla libreria e a tutte le funzionalità di Perlego. Le uniche differenze sono il prezzo e il periodo di abbonamento: con il piano annuale risparmierai circa il 30% rispetto a 12 rate con quello mensile.
Cos'è Perlego?
Perlego è un servizio di abbonamento a testi accademici, che ti permette di accedere a un'intera libreria online a un prezzo inferiore rispetto a quello che pagheresti per acquistare un singolo libro al mese. Con oltre 1 milione di testi suddivisi in più di 1.000 categorie, troverai sicuramente ciò che fa per te! Per maggiori informazioni, clicca qui.
Perlego supporta la sintesi vocale?
Cerca l'icona Sintesi vocale nel prossimo libro che leggerai per verificare se è possibile riprodurre l'audio. Questo strumento permette di leggere il testo a voce alta, evidenziandolo man mano che la lettura procede. Puoi aumentare o diminuire la velocità della sintesi vocale, oppure sospendere la riproduzione. Per maggiori informazioni, clicca qui.
Python Data Visualization Essentials Guide è disponibile online in formato PDF/ePub?
Sì, puoi accedere a Python Data Visualization Essentials Guide di Kalilur Rahman in formato PDF e/o ePub, così come ad altri libri molto apprezzati nelle sezioni relative a Computer Science e Data Visualisation. Scopri oltre 1 milione di libri disponibili nel nostro catalogo.

Informazioni

Anno
2021
ISBN
9789391030063

CHAPTER 1

Introduction to Data Visualization

"Human visual perception is grounded on a single set of biological and psychological principles, regardless of culture. Cultural differences should be taken into account but be aware that there are common foundations for what we do."
– Alberto Cairo
Data Visualization and data storytelling have taken the world by storm. Visualization skills are one of the hot skills in the market. Like the introduction, the focus of this book is to give an introduction to data visualization using Python as a primary tool of choice. Specifically, coming to the objective of this chapter, the idea is to introduce data visualization and the difference between various aspects of analysis and the importance of data visualization to the world of business analysis, product management, and its correlation. The importance of data visualization is so big that it determines the winners and losers. Some of the leading technology companies are successful due to their ability to derive insights into their customers' data, products, and services. Some of the large companies with a goldmine are data are not as successful as they should be due to the lack of ability to generate actionable insights.

Structure

In this chapter, we will cover the following topics:
  • What is data visualization
  • Key elements of data visualization
  • Importance of data visualization

Objective

This chapter aims at giving a good amount of introduction to what data visualization is about, why data visualization is important, how it evolved over time, and its key elements. This chapter will set the context for approaching data visualization from atechnology solution point of view using Python later in the chapters.
Data visualization is a visual art of storytelling with an intent to share insights with a meaningful purpose. Data visualization leverages graphical elements such as graphs, charts, maps, and other elements to produce a meaningful graphical representation of data and information. It is a powerful way to share insights on trends, patterns, and outliers in a set of data. The users can analyze the patterns and gain an insight into the data shared. It can be said that visualization is an art due to the creative aspects involved. Data visualization is a very powerful concept in today's world and an important skill to imbibe to succeed.

What is data visualization?

Human brains are trained to spot patterns through our experiential learning throughout life. There is a popular adage - A picture is worth a thousand words. Our eyes are attracted to patterns and colors. In combination with the brain's cognitive abilities, we are attracted by visuals that make an impact. It could be an image, scenery, or an image in a movie or a TV advertisement.
Visualization is a process that transforms the representation of real data from something into something meaningful in a visual representation. The key is meaningful rendering in a simple visualization, even for complex data. Similar to the quote, a picture is worth a thousand words, a good visualization tells a story simply and efficiently, like a good painting. It is visual art that can be used as a powerful storytelling tool.
In literature, I can't say that story A is better than poem B; I have to compare stories with stories and poems with poems, despite being all literature. The same applies to data visualization.
- Jorge Camões
Data visualization, in a metaphorical way, is one way to leverage the visual art of storytelling. Visualization is an intent to share insights with a meaningful purpose. Data visualization leverages graphical elements such as graphs, charts, maps, and other elements to produce a meaningful graphical representation of data and information. It is a powerful way to share insights on trends, patterns, and outliers in a set of data. The users can analyze the patterns and gain an insight into the data shared. It can be said that visualization is an art due to the creative aspects involved.
Data visualization is both an art (visuals) and science (the method of rendering the data) combined. Data visualization can be leveraged to display insights of both quantitative and qualitative data under analysis1.
The scientific part of data visualization is done using software or libraries available for rendering graphical visualization. This book is primarily dedicated to this aspect, focusing on a particular language we've chosen – Python.
The French cave paintings, at Chauvet-Pont-d'Arc Cave in France, albeit an art form, showcase some inspiring elements of visualization. Some of the themes used in data visualization were, in a way, let's say, inspired by some of the visual art forms that followed for years. It progressed further with abbreviations used for tax notifications by governments to simple graphs of a line graph or a bar chart before graduating to mind-boggling real-time interactive visualization.

Brilliant use of data visualization in history

Some of the greatest visualization examples include the following:
  1. Visualization of the cholera deaths by John Snow, known as the father of epidemiology, is a study of disease and patterns to identify measures to solve the issues. He visualized the cholera deaths for a London borough. While visualizing the outbreak in the city, he noted that the number of deaths at a particular street (Broad Street) near a water pump was high. This led to an insight that cholera was caused by germ-contaminated water than particles in the air. This changed the course of medicine and treatments for outbreaks.
  2. The brilliant use of wonderful data visualization by Florence Nightingale to record the causes of mortality during the Crimean war. Her fact-oriented visualizations proved that more soldiers died due to infections than that of actual fatality in the war. Her visualizations proved the power of inference of data. Data visualizations thrive on the power of insights and inference, and Nightingale's visualization brought the idea of a single picture being more powerful than thousand words to the fore. Florence Nightingale also produced other data visualization charts to prove a point to the government, healthcare professionals, and the public that sanitation is key for healthy lives. Florence Nightingale, also known as "The Lady with the Lamp," was a pioneering icon in statistics and data visualization.
Data visualization has transformed many organizations to become wildly successful and has helped governments make decisions to improve the lives of the citizens. Some of the data used are sales and profit numbers, market coverage, employee productivity, etc. It could be budget and revenue figures, health indicators for citizens, employment data, and education data to make policy decisions for governments. For humans, one major use case of impactful contribution by data visualization is the efficient usage to expand the average lifespan.
  • By helping the healthcare professionals to do the right type of diagnosis and analyzing to understand the patterns and outliers
  • By focusing on statistically important aspects to build procedures, discover and develop medicines, and choose treatment
  • By giving an insight into the trends, progress, and to make an informed decision for the betterment of business
Data visualization is a powerful way to tell a visual story that can help determine outliers, patterns, trends, and correlations of data available and make meaningful decisions.

Key elements of data visualization

In the 21st century, data visualization has picked a lot of momentum with the advent of increased use of artificial intelligence and data science. The use of data visualization for research and development, education, and commercial usage have expanded exponentially using interactive dashboards, infographics, and other data rendering tools. It is used in every aspect of our daily life. It is a lot easier to generate powerful visual stories, and I see at least 5 to 10 data visualization elements daily.
One of the keys to the success of the popularity is the evolution of data visualization as an art and science discipline. If you take the examples of visualizations by Florence Nightingale and John Snow, both took a considerable amount of manual analysis and attention to details and great application of key data visualization elements to make them ground-breaking. There are many definitions of key elements that vary from designer-to-designer and author-to-author. From a simplicity standpoint, let us see some of the key elements of data visualization.

Elements of data visualization

There are plenty of guides available covering the key elements and themes to be considered for effective data visualization. We shall cover some of the essential elements to focus on and consider while designing data visualization. Let us see the key elements for data visualization in a diagram. We can call this DUSSSS – Data, User, Strategy, Structure, Style, and Story.
Figure 1.1: Elements of data visualization
Figure 1.1 shows the key elements in a simple visualization in the form of a pictorial. We shall cover each of the elements in detail. We shall explore each element in a question-and-answer format. At a high level, the six key elements to focus on are:
  • Strategy: What is your data visualization strategy?
  • Structure: How are you planning to structure your story?
  • Data: What type of data are you planning to use? How many datasets are you planning to use?
  • Style: A key element on your visualization style, choice of visualization elements such as graphs and charts, choice of colors and other visual elements, use of qualitative and quantitative information to convey a message
  • User: The key to the success of the data visualization exercise. Who are your users? Why should they be using your product? What is the key takeaway for them?
  • Story: Most important aspect of the exercise. What are you trying to convey, what would the key insights, messages, actions, inspirations they can take away to implement actions?
Let us delve into a bit more detail on these elements.

Strategy

Having a good strategy for your visualization exercise is very important. This is because the visualization outcome is purely based on the data being used. Like bad content or a theme could derail a movie or an advertisement, bad data can result in poor outcomes in story, elegance, insights, etc. Having a good strategy is important for a visualization exercise. This includes data strategy, design strategy in terms of user persona and visual elements, etc.
  • Having good data capturing, data extracting, data cleansing, data integration strategy is very important. This strategy is especially important for planning interactive, real-time, update-oriented dashboards and data visualization. There should be a data strategy for the visualization exercise.
  • Another element to consider is the user experience and design thinking strategy to address the needs and wants of the users. Using a persona-based design of visual elements can help in designing better visualization elements.
    • As one size fits all does not exist, a designer bias can be avoided by taking the user requirements, and user needs into consideration through the empathy-based user-centric design of elements.
    • Design elements, style elements, visual themes, templates, messaging, colors, form factors, devices, and gesture-based themes and actions can all be thought in advance.
    • Having a clear structure, simplicity, better visibility, and consistency in design could be thought through before the design is done.

Story

Data visualization can be used for two purposes when it...

Indice dei contenuti