The Essential Guide to Data Science and its Applications
Bart Baesens
This is a test
This is a test
English
ePUB (mobile friendly)
Available on iOS & Android
eBook - ePub
Analytics in a Big Data World
The Essential Guide to Data Science and its Applications
Bart Baesens
Book details
Book preview
Table of contents
Citations
About This Book
The guide to targeting and leveraging business opportunities using big data & analytics
By leveraging big data & analytics, businesses create the potential to better understand, manage, and strategically exploiting the complex dynamics of customer behavior. Analytics in a Big Data World reveals how to tap into the powerful tool of data analytics to create a strategic advantage and identify new business opportunities. Designed to be an accessible resource, this essential book does not include exhaustive coverage of all analytical techniques, instead focusing on analytics techniques that really provide added value in business environments.
The book draws on author Bart Baesens' expertise on the topics of big data, analytics and its applications in e.g. credit risk, marketing, and fraud to provide a clear roadmap for organizations that want to use data analytics to their advantage, but need a good starting point. Baesens has conducted extensive research on big data, analytics, customer relationship management, web analytics, fraud detection, and credit risk management, and uses this experience to bring clarity to a complex topic.
Includes numerous case studies on risk management, fraud detection, customer relationship management, and web analytics
Offers the results of research and the author's personal experience in banking, retail, and government
Contains an overview of the visionary ideas and current developments on the strategic use of analytics for business
Covers the topic of data analytics in easy-to-understand terms without an undo emphasis on mathematics and the minutiae of statistical analysis
For organizations looking to enhance their capabilities via data analytics, this resource is the go-to reference for leveraging data to enhance business capabilities.
Frequently asked questions
How do I cancel my subscription?
Simply head over to the account section in settings and click on āCancel Subscriptionā - itās as simple as that. After you cancel, your membership will stay active for the remainder of the time youāve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlegoās features. The only differences are the price and subscription period: With the annual plan youāll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weāve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Analytics in a Big Data World an online PDF/ePUB?
Yes, you can access Analytics in a Big Data World by Bart Baesens in PDF and/or ePUB format, as well as other popular books in Business & Decision Making. We have over one million books available in our catalogue for you to explore.
Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data.1 In relative terms, this means 90 Āpercent of the data in the world has been created in the last two years. Gartner projects that by 2015, 85 percent of Fortune 500 organizations will be unable to exploit big data for competitive advantage and about 4.4 million jobs will be created around big data.2 Although these estimates should not be interpreted in an absolute sense, they are a strong indication of the ubiquity of big data and the strong need for analytical skills and resources because, as the data piles up, managing and analyzing these data resources in the most optimal way become critical success factors in creating competitive advantage and strategic Āleverage.
Figure 1.1 shows the results of a KDnuggets3 poll conducted during April 2013 about the largest data sets analyzed. The total number of respondents was 322 and the numbers per category are indicated between brackets. The median was estimated to be in the 40 to 50 gigabyte (GB) range, which was about double the median answer for a similar poll run in 2012 (20 to 40 GB). This clearly shows the quick increase in size of data that analysts are working on. A further regional breakdown of the poll showed that U.S. data miners lead other regions in big data, with about 28% of them working with terabyte (TB) size databases.
Figure 1.1 Results from a KDnuggets Poll about Largest Data Sets Analyzed
A main obstacle to fully harnessing the power of big data using analytics is the lack of skilled resources and ādata scientistā talent required to exploit big data. In another poll ran by KDnuggets in July 2013, a strong need emerged for analytics/big data/data mining/data science education.4 It is the purpose of this book to try and fill this gap by providing a concise and focused overview of analytics for the business practitioner.
EXAMPLE APPLICATIONS
Analytics is everywhere and strongly embedded into our daily lives. As I am writing this part, I was the subject of various analytical models today. When I checked my physical mailbox this morning, I found a catalogue sent to me most probably as a result of a response modeling analytical exercise that indicated that, given my characteristics and previous purchase behavior, I am likely to buy one or more products from it. Today, I was the subject of a behavioral scoring model of my financial institution. This is a model that will look at, among other things, my checking account balance from the past 12 months and my credit payments during that period, together with other kinds of information available to my bank, to predict whether I will default on my loan during the next year. My bank needs to know this for provisioning purposes. Also today, my telephone services provider analyzed my calling behavior and my account information to predict whether I will churn during the next three months. As I logged on to my Facebook page, the social ads appearing there were based on analyzing all information (posts, pictures, my friends and their behavior, etc.) available to Facebook. My Twitter posts will be analyzed (possibly in real time) by social media analytics to understand both the subject of my tweets and the sentiment of them. As I checked out in the supermarket, my loyalty card was scanned first, followed by all my purchases. This will be used by my supermarket to analyze my market basket, which will help it decide on product bundling, next best offer, improving shelf organization, and so forth. As I made the payment with my credit card, my credit card provider used a fraud detection model to see whether it was a legitimate transaction. When I receive my credit card statement later, it will be accompanied by various vouchers that are the result of an analytical customer segmentation exercise to better understand my expense behavior.
To summarize, the relevance, importance, and impact of analytics are now bigger than ever before and, given that more and more data are being collected and that there is strategic value in knowing what is hidden in data, analytics will continue to grow. Without claiming to be exhaustive, Table 1.1 presents some examples of how analytics is applied in various settings.
Table 1.1 Example Analytics Applications
Marketing
Risk Management
Government
Web
Logistics
Other
Response modeling
Credit risk modeling
Tax avoidance
Web analytics
Demand forecasting
Text analytics
Net lift modeling
Market risk modeling
Social security fraud
Social media analytics
Supply chain analytics
Business process analytics
Retention modeling
Operational risk modeling
Money laundering
Multivariate testing
Market basket analysis
Fraud detection
Terrorism detection
Recommender systems
Customer segmentation
It is the purpose of this book to discuss the underlying techniques and key challenges to work out the applications shown in Table 1.1 using analytics. Some of these applications will be discussed in further detail in Chapter 8.
BASIC NOMENCLATURE
In order to start doing analytics, some basic vocabulary needs to be defined. A first important concept here concerns the basic unit of analysis. Customers can be considered from various perspectives. Customer lifetime value (CLV) can be measured for either individual customers or at the household level. Another alternative is to look at account behavior. For example, consider a credit scoring exercise for which the aim is to predict whether the applicant will default on a particular mortgage loan account. The analysis can also be done at the transaction level. For example, in insurance fraud detection, one usually performs the analysis at insurance claim level. Also, in web analytics, the basic unit of analysis is usually a web visit or session.
It is also important to note that customers can play different roles. For example, parents can buy goods for their kids, such that there is a clear distinction between the payer and the end user. In a banking setting, a customer can be primary account owner, secondary account owner, main debtor of the credit, codebtor, guarantor, and so on. It is very important to clearly distinguish between those different roles when defining and/or aggregating data for the analytics exercise.
Finally, in case of predictive analytics, the target variable needs to be appropriately defined. For example, when is a customer considered to be a churner or not, a fraudster or not, a responder or not, or how should the CLV be appropriately defined?
ANALYTICS PROCESS MODEL
Figure 1.2 gives a high-level overview of the analytics process model.5 As a first step, a thorough definition of the business problem to be solved...
Table of contents
Citation styles for Analytics in a Big Data World
APA 6 Citation
Baesens, B. (2014). Analytics in a Big Data World (1st ed.). Wiley. Retrieved from https://www.perlego.com/book/995321/analytics-in-a-big-data-world-the-essential-guide-to-data-science-and-its-applications-pdf (Original work published 2014)
Chicago Citation
Baesens, Bart. (2014) 2014. Analytics in a Big Data World. 1st ed. Wiley. https://www.perlego.com/book/995321/analytics-in-a-big-data-world-the-essential-guide-to-data-science-and-its-applications-pdf.
Harvard Citation
Baesens, B. (2014) Analytics in a Big Data World. 1st edn. Wiley. Available at: https://www.perlego.com/book/995321/analytics-in-a-big-data-world-the-essential-guide-to-data-science-and-its-applications-pdf (Accessed: 14 October 2022).
MLA 7 Citation
Baesens, Bart. Analytics in a Big Data World. 1st ed. Wiley, 2014. Web. 14 Oct. 2022.