Data Wrangling with Python
eBook - ePub

Data Wrangling with Python

Creating actionable data from raw sources

Dr. Tirthajyoti Sarkar, Shubhadeep Roychowdhury

Share book
  1. 452 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Data Wrangling with Python

Creating actionable data from raw sources

Dr. Tirthajyoti Sarkar, Shubhadeep Roychowdhury

Book details
Table of contents
Citations

About This Book

Simplify your ETL processes with these hands-on data hygiene tips, tricks, and best practices.

Key Features

  • Focus on the basics of data wrangling
  • Study various ways to extract the most out of your data in less time
  • Boost your learning curve with bonus topics like random data generation and data integrity checks

Book Description

For data to be useful and meaningful, it must be curated and refined. Data Wrangling with Python teaches you the core ideas behind these processes and equips you with knowledge of the most popular tools and techniques in the domain.

The book starts with the absolute basics of Python, focusing mainly on data structures. It then delves into the fundamental tools of data wrangling like NumPy and Pandas libraries. You'll explore useful insights into why you should stay away from traditional ways of data cleaning, as done in other languages, and take advantage of the specialized pre-built routines in Python. This combination of Python tips and tricks will also demonstrate how to use the same Python backend and extract/transform data from an array of sources including the Internet, large database vaults, and Excel financial tables. To help you prepare for more challenging scenarios, you'll cover how to handle missing or wrong data, and reformat it based on the requirements from the downstream analytics tool. The book will further help you grasp concepts through real-world examples and datasets.

By the end of this book, you will be confident in using a diverse array of sources to extract, clean, transform, and format your data efficiently.

What you will learn

  • Use and manipulate complex and simple data structures
  • Harness the full potential of DataFrames and numpy.array at run time
  • Perform web scraping with BeautifulSoup4 and html5lib
  • Execute advanced string search and manipulation with RegEX
  • Handle outliers and perform data imputation with Pandas
  • Use descriptive statistics and plotting techniques
  • Practice data wrangling and modeling using data generation techniques

Who this book is for

Data Wrangling with Python is designed for developers, data analysts, and business analysts who are keen to pursue a career as a full-fledged data scientist or analytics expert. Although, this book is for beginners, prior working knowledge of Python is necessary to easily grasp the concepts covered here. It will also help to have rudimentary knowledge of relational database and SQL.

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Data Wrangling with Python an online PDF/ePUB?
Yes, you can access Data Wrangling with Python by Dr. Tirthajyoti Sarkar, Shubhadeep Roychowdhury in PDF and/or ePUB format, as well as other popular books in Ciencia de la computación & Programación en Python. We have over one million books available in our catalogue for you to explore.

Information

Year
2019
ISBN
9781789804249

Table of contents