Julia for Data Science
eBook - ePub

Julia for Data Science

Anshul Joshi

Partager le livre
  1. 346 pages
  2. English
  3. ePUB (adapté aux mobiles)
  4. Disponible sur iOS et Android
eBook - ePub

Julia for Data Science

Anshul Joshi

DĂ©tails du livre
Aperçu du livre
Table des matiĂšres
Citations

À propos de ce livre

Explore the world of data science from scratch with Julia by your side

About This Book

  • An in-depth exploration of Julia's growing ecosystem of packages
  • Work with the most powerful open-source libraries for deep learning, data wrangling, and data visualization
  • Learn about deep learning using Mocha.jl and give speed and high performance to data analysis on large data sets

Who This Book Is For

This book is aimed at data analysts and aspiring data scientists who have a basic knowledge of Julia or are completely new to it. The book also appeals to those competent in R and Python and wish to adopt Julia to improve their skills set in Data Science. It would be beneficial if the readers have a good background in statistics and computational mathematics.

What You Will Learn

  • Apply statistical models in Julia for data-driven decisions
  • Understanding the process of data munging and data preparation using Julia
  • Explore techniques to visualize data using Julia and D3 based packages
  • Using Julia to create self-learning systems using cutting edge machine learning algorithms
  • Create supervised and unsupervised machine learning systems using Julia. Also, explore ensemble models
  • Build a recommendation engine in Julia
  • Dive into Julia's deep learning framework and build a system using Mocha.jl

In Detail

Julia is a fast and high performing language that's perfectly suited to data science with a mature package ecosystem and is now feature complete. It is a good tool for a data science practitioner. There was a famous post at Harvard Business Review that Data Scientist is the sexiest job of the 21st century. (https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century).

This book will help you get familiarised with Julia's rich ecosystem, which is continuously evolving, allowing you to stay on top of your game.

This book contains the essentials of data science and gives a high-level overview of advanced statistics and techniques. You will dive in and will work on generating insights by performing inferential statistics, and will reveal hidden patterns and trends using data mining. This has the practical coverage of statistics and machine learning. You will develop knowledge to build statistical models and machine learning systems in Julia with attractive visualizations.

You will then delve into the world of Deep learning in Julia and will understand the framework, Mocha.jl with which you can create artificial neural networks and implement deep learning.

This book addresses the challenges of real-world data science problems, including data cleaning, data preparation, inferential statistics, statistical modeling, building high-performance machine learning systems and creating effective visualizations using Julia.

Style and approach

This practical and easy-to-follow yet comprehensive guide will get you learning about Julia with respect to data science. Each topic is explained thoroughly and placed in context. For the more inquisitive, we dive deeper into the language and its use case. This is the one true guide to working with Julia in data science.

Foire aux questions

Comment puis-je résilier mon abonnement ?
Il vous suffit de vous rendre dans la section compte dans paramĂštres et de cliquer sur « RĂ©silier l’abonnement ». C’est aussi simple que cela ! Une fois que vous aurez rĂ©siliĂ© votre abonnement, il restera actif pour le reste de la pĂ©riode pour laquelle vous avez payĂ©. DĂ©couvrez-en plus ici.
Puis-je / comment puis-je télécharger des livres ?
Pour le moment, tous nos livres en format ePub adaptĂ©s aux mobiles peuvent ĂȘtre tĂ©lĂ©chargĂ©s via l’application. La plupart de nos PDF sont Ă©galement disponibles en tĂ©lĂ©chargement et les autres seront tĂ©lĂ©chargeables trĂšs prochainement. DĂ©couvrez-en plus ici.
Quelle est la différence entre les formules tarifaires ?
Les deux abonnements vous donnent un accĂšs complet Ă  la bibliothĂšque et Ă  toutes les fonctionnalitĂ©s de Perlego. Les seules diffĂ©rences sont les tarifs ainsi que la pĂ©riode d’abonnement : avec l’abonnement annuel, vous Ă©conomiserez environ 30 % par rapport Ă  12 mois d’abonnement mensuel.
Qu’est-ce que Perlego ?
Nous sommes un service d’abonnement Ă  des ouvrages universitaires en ligne, oĂč vous pouvez accĂ©der Ă  toute une bibliothĂšque pour un prix infĂ©rieur Ă  celui d’un seul livre par mois. Avec plus d’un million de livres sur plus de 1 000 sujets, nous avons ce qu’il vous faut ! DĂ©couvrez-en plus ici.
Prenez-vous en charge la synthÚse vocale ?
Recherchez le symbole Écouter sur votre prochain livre pour voir si vous pouvez l’écouter. L’outil Écouter lit le texte Ă  haute voix pour vous, en surlignant le passage qui est en cours de lecture. Vous pouvez le mettre sur pause, l’accĂ©lĂ©rer ou le ralentir. DĂ©couvrez-en plus ici.
Est-ce que Julia for Data Science est un PDF/ePUB en ligne ?
Oui, vous pouvez accĂ©der Ă  Julia for Data Science par Anshul Joshi en format PDF et/ou ePUB ainsi qu’à d’autres livres populaires dans Computer Science et Programming Algorithms. Nous disposons de plus d’un million d’ouvrages Ă  dĂ©couvrir dans notre catalogue.

Informations

Année
2016
ISBN
9781785289699

Julia for Data Science


Julia for Data Science

Copyright © 2016 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
First published: September 2016
Production reference: 1260916
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-78528-969-9
www.packtpub.com

Credits

Author
Anshul Joshi
Copy Editor
Safis Editing
Reviewer
SĂ©bastien Celles
Project Coordinator
Nidhi Joshi
Commissioning Editor
Akram Hussain
Proofreader
Safis Editing
Acquisition Editor
Sonali Vernekar
Indexer
Mariammal Chettiyar
Content Development Editor
Aishwarya Pandere
Graphics
Disha Haria
Technical Editor
Vivek Arora
Production Coordinator
Arvindkumar Gupta

About the Author

Anshul Joshi is a data science professional with more than 2 years of experience primarily in data munging, recommendation systems, predictive modeling, and distributed computing. He is a deep learning and AI enthusiast. Most of the time, he can be caught exploring GitHub or trying anything new on which he can get his hands on. He blogs on anshuljoshi.xyz.
I'd like to thank my parents, who have been really supportive throughout, my professors, who helped me during my days at university and got me where I am, and my friends, who were very understanding. A big thanks to the Julia community. These people are amazing and are the rockstars of our generation.
I would also like to thank Packt Publishing and the editors for helping me throughout. A special thanks to SĂ©bastien Celles; his expertise and reviews really helped me improve the book.

About the Reviewer

SĂ©bastien Celles is a professor of applied physics at Poitiers Institute of Technology (UniversitĂ© de Poitiers—IUT de Poitiers—thermal science department). He teaches physics and computer sciences (data processing).
He has used Python for numerical simulations, data plotting, data predicting, and various other tasks since the early 2000s. He is a member of PyData and was granted commit rights to the pandas DataReader project. He is also involved in several open source projects about the scientific Python ecosystem.
He is also author of some Python packages available on PyPi:
  • openweathermap_requests: A package to fetch data from http://openweathermap.org/ using requests and requests-cache and get pandas DataFrames with weather history
  • pandas_degreedays: A package to calculate degree days (a measure of heating or cooling) from a pandas time series of temperature
  • pandas_confusion: A package to manage confusion matrices, plot them, binarize them, calculate overall statistics, and class statistics
He made some contributions (unit testing, continuous integration, Python 3 port
) too:
  • python-constraint: A Constraint Solving Problem (CSP) resolver for Python
He was a technical reviewer of Mastering Python for Data Science explores the world of data science through Python and learn how to make sense of data. Samir Madhavan. Birmingham, UK, Packt Publishing, August 2015.
Two years ago, he started to learn Julia, with which he has performed various tasks about data mining, machine learning, forecasting, and so he's a user of (and sometimes a contributor too) some Julia packages (DataStructures.jl, CSV.jl, DataFrames.jl, TimeSeries.jl, NDSparseData.jl, JuliaTS.jl, MLBase.jl, Mocha.jl, and so on)
He is also author of some Julia packages:
  • Pushover.jl: A package to send notifications using the Pushover Notification Service
  • BulkSMS.jl: A Julia package to send SMS (Short Message Service) using BulkSMS API
  • DataReaders.jl: A package to get remote data via Requests.jl and get DataFrames thanks to DataFrames.jl
  • RequestsCache.jl: A transparent persistent cache using the Requests.jl library to perform requests and using JLD.jl library as a storage backend
  • PubSub.jl: A very basic implementation of the publish-subscribe pattern
  • SignalSlot.jl: A very basic implementation of the signal-slot pattern
  • TALib.jl: A Julia wrapper for TA-Lib (Technical Analysis Library)
He has a keen interest in open data and he is a contributor of some projects of the Open Knowledge Foundation (especially around the DataPackage format).
You can find more information about him at http://www.celles.net/wiki/Contact.

www.PacktPub.com

For support files and downloads related to your book, please visit www.PacktPub.com.
Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at [email protected] for more details.
At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.
www.PacktPub.com
https://www.packtpub.com/mapt
Get the most in-demand software skills with Mapt. Mapt gives you full access to all Packt books and video courses, as well as industry-leading tools to help you pl...

Table des matiĂšres