
Big Data with Hadoop MapReduce
A Classroom Approach
- 406 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Big Data with Hadoop MapReduce
A Classroom Approach
About this book
The authors provide an understanding of big data and MapReduce by clearly presenting the basic terminologies and concepts. They have employed over 100 illustrations and many worked-out examples to convey the concepts and methods used in big data, the inner workings of MapReduce, and single node/multi-node installation on physical/virtual machines. This book covers almost all the necessary information on Hadoop MapReduce for most online certification exams. Upon completing this book, readers will find it easy to understand other big data processing tools such as Spark, Storm, etc.
Ultimately, readers will be able to:
• understand what big data is and the factors that are involved
• understand the inner workings of MapReduce, which is essential for certification exams
• learn the features and weaknesses of MapReduce
• set up Hadoop clusters with 100s of physical/virtual machines
• create a virtual machine in AWS
• write MapReduce with Eclipse in a simple way
• understand other big data processing tools and their applications
Frequently asked questions
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Information
CHAPTER 1
Big Data
INTRODUCTION
1.1 BIG DATA
Some interesting facts on big data
1.1.1 BIG DATA SOURCES
before 1980 – devices were generating data.1980–2000 – employees generated data as an end user.since 2000 – people started contributing data via social applications, e-mails, etc.after 2005 – every hardware, software, application generated log data.
Table of contents
- Cover
- Half Title
- Title Page
- Copyright Page
- About the Authors
- A Message from Kaniyan
- Table of Contents
- Abbreviations
- Preface
- Dedication and Acknowledgment
- Introduction
- 1. Big Data
- 2. Hadoop Framework
- 3. Hadoop 1.2.1 Installation
- 4. Hadoop Ecosystem
- 5. Hadoop 2.7.0
- 6. Hadoop 2.7.0 Installation
- 7. Data Science
- APPENDIX A: Public Datasets
- APPENDIX B: MapReduce Exercise
- APPENDIX C: Case Study: Application Development for NYSE Dataset
- Web References
- Index