SQL Server 2017 Machine Learning Services with R
eBook - ePub

SQL Server 2017 Machine Learning Services with R

  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

SQL Server 2017 Machine Learning Services with R

About this book

Develop and run efficient R scripts and predictive models for SQL Server 2017About This Book• Learn how you can combine the power of R and SQL Server 2017 to build efficient, cost-effective data science solutions• Leverage the capabilities of R Services to perform advanced analytics—from data exploration to predictive modeling• A quick primer with practical examples to help you get up- and- running with SQL Server 2017 Machine Learning Services with R, as part of database solutions with continuous integration / continuous delivery.Who This Book Is ForThis book is for data analysts, data scientists, and database administrators with some or no experience in R but who are eager to easily deliver practical data science solutions in their day-to-day work (or future projects) using SQL Server.What You Will Learn• Get an overview of SQL Server 2017 Machine Learning Services with R• Manage SQL Server Machine Learning Services from installation to configuration and maintenance• Handle and operationalize R code • Explore RevoScaleR R algorithms and create predictive models• Deploy, manage, and monitor database solutions with R• Extend R with SQL Server 2017 features• Explore the power of R for database administratorsIn DetailR Services was one of the most anticipated features in SQL Server 2016, improved significantly and rebranded as SQL Server 2017 Machine Learning Services. Prior to SQL Server 2016, many developers and data scientists were already using R to connect to SQL Server in siloed environments that left a lot to be desired, in order to do additional data analysis, superseding SSAS Data Mining or additional CLR programming functions. With R integrated within SQL Server 2017, these developers and data scientists can now benefit from its integrated, effective, efficient, and more streamlined analytics environment. This book gives you foundational knowledge and insights to help you understand SQL Server 2017 Machine Learning Services with R. First and foremost, the book provides practical examples on how to implement, use, and understand SQL Server and R integration in corporate environments, and also provides explanations and underlying motivations. It covers installing Machine Learning Services;maintaining, deploying, and managing code;and monitoring your services. Delving more deeply into predictive modeling and the RevoScaleR package, this book also provides insights into operationalizing code and exploring and visualizing data. To complete the journey, this book covers the new features in SQL Server 2017 and how they are compatible with R, amplifying their combined power.Style and approachThis fast-paced guide will help data scientists and DBAs implement all new data science projects using SQL Server 2017 Machine Learning Services.

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access SQL Server 2017 Machine Learning Services with R by Tomaz Kastrun, Julie Koesmarno in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Modelling & Design. We have over one million books available in our catalogue for you to explore.

Deploying, Managing, and Monitoring Database Solutions containing R Code

Operationalizing R code in a SQL Server database means that data scientists/database developers can also leverage productionizing data science solutions as part of Database Lifecycle Management (DLM). This includes the following:
  • Checking in R code as part of a SQL Server database project into a version control
  • Adding the stored procedures for the data science solution as part of SQL Server unit tests
  • Integrating the data science solution into the Continuous Integration/Continuous Delivery (CI/CD) process
  • Monitoring performance of the data science solution in the production on a regular basis
In this chapter, we will be using SQL Server Data Tools (SSDT) in Visual Studio 2017 and Visual Studio Team Services to perform this DLM workflow. However, the underlying concept can be applied to any other CI/CD platform that you or your team might already be using.

Integrating R into the SQL Server Database lifecycle workflow

Earlier in Chapter 7, Operationalizing R Prediction Models, we discussed how to create an R project in Visual Studio 2017. We also talked about integrating R code as part of sp_execute_external_script in SQL Server. Here, we will revisit Visual Studio 2017, specifically in the context of integrating R code in sp_execute_external_script as part of a SQL Server Database Project, and holistically as part of the database lifecycle workflow.

Preparing your environment for the database lifecycle workflow

In this section, we will discuss the stages of the database lifecycle workflow and the tools that we will be using. For each of the stages in the workflow, there will also be some suggested alternatives for you to explore.
  1. Coding and managing SQL Server database projects/solutions: There are a few different ways to manage your SQL Server DML/DDL scripts that form a SQL Server database project. SQL SSDT in Visual Studio 2017 (VS2017) is a mature product that formalizes the creation and modification of Database Schema and Objects. In this section, we will use SSDT in VS2017.
You can use VS2017 Community, Professional, or Enterprise editions. Please check https://www.visualstudio.com/vs/compare/ for more up-to-date information on how these editions compare. In the walkthroughs and examples in this section, we will be using Visual Studio Enterprise Edition, but you can use any of the editions. You can download these from: https://www.visualstudio.com/vs/.
Other following alternatives worth trying are:
    • SQL Server Management Studio: There are a few plugins developed by RedGate that can enrich DevOps/Database Lifecycle Management
    • SQL Operations Studio (in Preview): This tool is built based on VS Code, which means that it has high potential of meeting DevOps workflows too, including source control
  1. Unit testing: Just like application development, database development would benefit from a unit testing framework, especially if it can be automated. There are two well-known unit testing frameworks that are available for SQL Server databases, tSQLt, and SQL Server Unit Test integrated in Visual Studio. Here are the links:
    • tSQLt: http://tsqlt.org/
    • SQL Server Unit Test in Visual Studio: https://msdn.microsoft.com/en-us/library/jj851200(v=vs.103).aspx
In this section, we will use SQL Server Unit Test in VS2017.
Another tool worth trying is:
    • RedGate SQL Test that is based on the tSQLt framework, an extension of SSMS
  1. Version Control: There are a number of popular choices for Version Control systems out there, for example, Git and Team Foundation Version Control (TFVC). In this section, we will use TFVC hosted in Visual Studio Team Services (VSTS). VS2017 can connect to a VSTS repository. You can sign up for a VSTS account online at: https://www.visualstudio.com/team-services/.
Other alternatives worth trying are:
Using Visual Studio, you can connect to online version control hosts, such as GitHub and...

Table of contents

  1. Title Page
  2. Copyright and Credits
  3. www.PacktPub.com
  4. Contributors
  5. Preface
  6. Introduction to R and SQL Server
  7. Overview of Microsoft Machine Learning Server and SQL Server
  8. Managing Machine Learning Services for SQL Server 2017 and R
  9. Data Exploration and Data Visualization
  10. RevoScaleR Package
  11. Predictive Modeling
  12. Operationalizing R Code
  13. Deploying, Managing, and Monitoring Database Solutions containing R Code
  14. Machine Learning Services with R for DBAs
  15. R and SQL Server 2016/2017 Features Extended
  16. Other Books You May Enjoy