Data Engineering with AWS
eBook - ePub

Data Engineering with AWS

A practical guide to building scalable and secure enterprise data platforms (English Edition)

  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

Data Engineering with AWS

A practical guide to building scalable and secure enterprise data platforms (English Edition)

About this book

Description
Data engineering and AWS form the backbone of modern enterprise data architecture, enabling organizations to harness the exponential growth of data for competitive advantage. As businesses generate petabytes of information daily, the ability to build scalable, secure, and cost-effective data platforms has become critical for survival in today's data-driven economy.

This comprehensive guide takes you through the complete journey of building enterprise-grade data platforms on AWS. You will understand data lake foundations with S3, implement real-time streaming with Kinesis, and optimize batch processing using Glue. The book covers advanced topics, including data warehouse engineering with Redshift, modern architectural patterns like data mesh, and cross-boundary data sharing strategies. The guide explores the GenAI revolution transforming data platforms from human-centric to AI-native systems, covering enhanced medallion architectures that serve both traditional analytics and generative AI workloads.

By the end of this book, you will be able to design and build scalable, secure, and cost-effective data platforms on AWS. You will master the skills to process massive datasets, implement enterprise-grade security, and architect solutions for real-time analytics and ML workflows, ultimately driving significant business value.

What you will learn
? Build petabyte-scale data lakes using S3 and Lake Formation.
? Implement real-time streaming pipelines with Kinesis and Lambda.
? Design cost-optimized data warehouses using Amazon Redshift.
? Create modern data mesh architectures on AWS.
? Master DataOps practices with CI/CD and IaC.
? Architect GenAI-native platforms with enhanced medallion architectures.
? Integrate ML pipelines using SageMaker and Glue.
? Implement enterprise security and governance strategies.

Who this book is for
This book is ideal for data engineers, cloud architects, DevOps engineers, and solutions architects building data platforms on AWS. Data scientists, ML engineers, and technical managers seeking to understand modern data infrastructure implementation will also find immense value.

Table of Contents
1. Modern Data Engineering Landscape
2. Building Data Lake Foundations
3. Data Formats and Storage Optimization
4. Real-time Data Ingestion and Streaming
5. Batch Data Processing
6. Data Transformation and Quality
7. Data Warehouse Engineering with Redshift
8. Modern Data Architecture Patterns
9. Data Governance and Security
10. Cross-boundary Data Sharing and Collaborations
11. Analytics and Visualization
12. Machine Learning Integration
13. DataOps and Automation
14. GenAI Revolution in Data Engineering
15. Future-Proofing Data Platforms
Appendix: Performance Tuning Guide

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access Data Engineering with AWS by Sanjiv Kumar Jha in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Warehousing. We have over one million books available in our catalogue for you to explore.

Information

Table of contents

  1. Cover
  2. Title Page
  3. Copyright Page
  4. Dedication Page
  5. About the Author
  6. About the Reviewers
  7. Acknowledgement
  8. Preface
  9. Table of Contents
  10. 1. Modern Data Engineering Landscape
  11. 2. Building Data Lake Foundations
  12. 3. Data Formats and Storage Optimization
  13. 4. Real-time Data Ingestion and Streaming
  14. 5. Batch Data Processing
  15. 6. Data Transformation and Quality
  16. 7. Data Warehouse Engineering with Redshift
  17. 8. Modern Data Architecture Patterns
  18. 9. Data Governance and Security
  19. 10. Cross-boundary Data Sharing and Collaborations
  20. 11. Analytics and Visualization
  21. 12. Machine Learning Integration
  22. 13. DataOps and Automation
  23. 14. GenAI Revolution in Data Engineering
  24. 15. Future-Proofing Data Platforms
  25. Appendix: Performance Tuning Guide
  26. Index