Big Data Analytics
eBook - ePub

Big Data Analytics

Tools and Technology for Effective Planning

Arun K. Somani, Ganesh Chandra Deka, Arun K. Somani, Ganesh Chandra Deka

  1. 399 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Big Data Analytics

Tools and Technology for Effective Planning

Arun K. Somani, Ganesh Chandra Deka, Arun K. Somani, Ganesh Chandra Deka

Book details
Book preview
Table of contents
Citations

About This Book

The proposed book will discuss various aspects of big data Analytics. It will deliberate upon the tools, technology, applications, use cases and research directions in the field. Chapters would be contributed by researchers, scientist and practitioners from various reputed universities and organizations for the benefit of readers.

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Big Data Analytics an online PDF/ePUB?
Yes, you can access Big Data Analytics by Arun K. Somani, Ganesh Chandra Deka, Arun K. Somani, Ganesh Chandra Deka in PDF and/or ePUB format, as well as other popular books in Informatique & Bases de données. We have over one million books available in our catalogue for you to explore.

Information

Year
2017
ISBN
9781315391243
Edition
1

1Challenges in Big Data

Pothireddy Venkata Lakshmi Narayana Rao, Pothireddy Siva Abhilash, and PS Pavan Kumar
Introduction
Background
Goals and Challenges of Analyzing Big Data
Paradigm Shifts
Organization of This Paper
Algorithms for Big Data Analytics
k-Means
Classification Algorithms: k-NN
Application of Big Data: A Case Study
Economics and Finance
Other Applications
Salient Features of Big Data
Heterogeneity
Noise Accumulation
Spurious Correlation
Coincidental Endogeneity
Impact on Statistical Thinking
Independence Screening
Dealing with Incidental Endogeneity
Impact on Computing Infrastructure
Literature Review
MapReduce
Cloud Computing
Impact on Computational Methods
First-Order Methods for Non-Smooth Optimization
Dimension Reduction and Random Projection
Future Perspectives and Conclusion
Existing Methods
Proposed Methods
Probabilistic Graphical Modeling
Mining Twitter Data: From Content to Connections
Late Work: Location-Specific Tweet Detection and Topic Summarization in Twitter
Tending to Big Data Challenges in Genome Sequencing and RNA Interaction Prediction
Single-Cell Genome Sequencing
RNA Structure and RNA–RNA Association Expectation
Identifying Qualitative Changes in Living Systems
Acknowledgments
References
Additional References for Researchers and Advanced Readers for Further Reading
Key Terminology and Definitions

Introduction

Enormous data guarantee new levels of investigative disclosure and financial quality. What is new about Big Data and how they vary from the conventional little or medium-scale information? This paper outlines the open doors and difficulties brought by Big Data, with accentuation on the recognized elements of Big Data and measurable and computational technique and in addition registering engineering to manage them.

Background

We are entering the time of Big Data, a term that alludes to the blast of data now accessible. Such a Big Data development is driven by the way that gigantic measures of high-dimensional or unstructured information are consistently delivered and are presented in a much less “luxurious” format than they used to be. For instance, in genomics we have seen an enormous drop in costs for sequencing of an entire genome [1]. This is likewise valid in many different scientific areas, for example, online network examination, biomedical imaging, high-recurrence money transactions, investigation of reconnaissance recordings, and retail deals. The current pattern for these vast amounts of information to be delivered and stored in an inexpensive manner is likely to keep up or even quicken in the future [2]. This pattern will have a profound effect on science, designing, and business. For instance, logical advances are turning out to be increasingly information driven, and specialists will increasingly consider themselves customers of information. The monstrous measures of high-dimensional information convey both open doors and new difficulties to information examination. Substantial measurable investigations for Big Data handling are turning out to be progressively essential.

Goals and Challenges of Analyzing Big Data

What are the purposes of violation depressed Big Data? As per Fan and Lu [3], two principal objectives of high-dimensional information investigation are to create powerful strategies that can precisely anticipate the future perceptions and in the meantime gain understanding into the relationship between the elements and reactions for experimental purposes. In addition, because of the extensive specimen size, Big Data offers an ascent to two more objectives: to comprehend heterogeneity and shared traits across various subpopulations.
At the end of the day, Big Data gives guarantees for:
  1. Investigating the shrouded structures of every subpopulation of the information, which is generally not possible and may even be dealt with as “exceptions” when the specimen size is small; and
  2. Extricating imperative regular elements across numerous subpopulations notwithstanding the expansive individual varieties of data.
What are the difficulties of investigating Big Data? Big Data is portrayed by high dimensionality and substantial specimen size. These two elements raise three one-of-a-kind difficulties:
  1. High dimensionality brings clamor gathering, spurious relationships, and coincidental homogeneity;
  2. High dimensionality consolidated with vast specimen size brings additional considerations, for example, regarding substantial computational expense and algorithmic flimsiness;
  3. The gigantic examples in Big Data are regularly totaled from various sources at various times, utilizing distinctive advances. This creates issues regarding heterogeneity, trial varieties, and factual predispositions and obliges us to employ more versatile and hardy methodologies.

Paradigm Shifts

To handle the troubles of Big Data, we require new quantifiable derivation and computational techniques. As an example, various standard systems that perform well for moderate test sizes don’t scale to enormous amounts of data. Basically, various truthful methodologies that perform well for low-dimensional data are going up against basic troubles in separating high-dimensional data. To plot effective, truthful strategies for exploring and anticipating Big Data, we need to address Big Data issues, for instance, heterogeneity, hullabaloo gathering, spurious connections, and fortuitous endogeneity, despite changing the quantifiable precision and computational profitability.
With respect to exactness, estimation diminishment, and variable determination are critical parts in exploring high-dimensional data. We will address these disturbing, building issues. As a case in point, in a high-dimensional portrayal, Fan and Fan [4] and Pittelkow and Ghosh [5] reported ...

Table of contents

Citation styles for Big Data Analytics

APA 6 Citation

[author missing]. (2017). Big Data Analytics (1st ed.). CRC Press. Retrieved from https://www.perlego.com/book/1498218/big-data-analytics-tools-and-technology-for-effective-planning-pdf (Original work published 2017)

Chicago Citation

[author missing]. (2017) 2017. Big Data Analytics. 1st ed. CRC Press. https://www.perlego.com/book/1498218/big-data-analytics-tools-and-technology-for-effective-planning-pdf.

Harvard Citation

[author missing] (2017) Big Data Analytics. 1st edn. CRC Press. Available at: https://www.perlego.com/book/1498218/big-data-analytics-tools-and-technology-for-effective-planning-pdf (Accessed: 14 October 2022).

MLA 7 Citation

[author missing]. Big Data Analytics. 1st ed. CRC Press, 2017. Web. 14 Oct. 2022.