Advanced Data Science and Analytics with Python
eBook - ePub

Advanced Data Science and Analytics with Python

Jesus Rogel-Salazar

Compartir libro
  1. 384 páginas
  2. English
  3. ePUB (apto para móviles)
  4. Disponible en iOS y Android
eBook - ePub

Advanced Data Science and Analytics with Python

Jesus Rogel-Salazar

Detalles del libro
Vista previa del libro
Índice
Citas

Información del libro

Advanced Data Science and Analytics with Python enables data scientists to continue developing their skills and apply them in business as well as academic settings. The subjects discussed in this book are complementary and a follow-up to the topics discussed in Data Science and Analytics with Python. The aim is to cover important advanced areas in data science using tools developed in Python such as SciKit-learn, Pandas, Numpy, Beautiful Soup, NLTK, NetworkX and others. The model development is supported by the use of frameworks such as Keras, TensorFlow and Core ML, as well as Swift for the development of iOS and MacOS applications.

Features:

  • Targets readers with a background in programming, who are interested in the tools used in data analytics and data science
  • Uses Python throughout
  • Presents tools, alongside solved examples, with steps that the reader can easily reproduce and adapt to their needs
  • Focuses on the practical use of the tools rather than on lengthy explanations
  • Provides the reader with the opportunity to use the book whenever needed rather than following a sequential path

The book can be read independently from the previous volume and each of the chapters in this volume is sufficiently independent from the others, providing flexibility for the reader. Each of the topics addressed in the book tackles the data science workflow from a practical perspective, concentrating on the process and results obtained. The implementation and deployment of trained models are central to the book.

Time series analysis, natural language processing, topic modelling, social network analysis, neural networks and deep learning are comprehensively covered. The book discusses the need to develop data products and addresses the subject of bringing models to their intended audiences – in this case, literally to the users' fingertips in the form of an iPhone app.

About the Author

Dr. Jesús Rogel-Salazar is a lead data scientist in the field, working for companies such as Tympa Health Technologies, Barclays, AKQA, IBM Data Science Studio and Dow Jones. He is a visiting researcher at the Department of Physics at Imperial College London, UK and a member of the School of Physics, Astronomy and Mathematics at the University of Hertfordshire, UK.

Preguntas frecuentes

¿Cómo cancelo mi suscripción?
Simplemente, dirígete a la sección ajustes de la cuenta y haz clic en «Cancelar suscripción». Así de sencillo. Después de cancelar tu suscripción, esta permanecerá activa el tiempo restante que hayas pagado. Obtén más información aquí.
¿Cómo descargo los libros?
Por el momento, todos nuestros libros ePub adaptables a dispositivos móviles se pueden descargar a través de la aplicación. La mayor parte de nuestros PDF también se puede descargar y ya estamos trabajando para que el resto también sea descargable. Obtén más información aquí.
¿En qué se diferencian los planes de precios?
Ambos planes te permiten acceder por completo a la biblioteca y a todas las funciones de Perlego. Las únicas diferencias son el precio y el período de suscripción: con el plan anual ahorrarás en torno a un 30 % en comparación con 12 meses de un plan mensual.
¿Qué es Perlego?
Somos un servicio de suscripción de libros de texto en línea que te permite acceder a toda una biblioteca en línea por menos de lo que cuesta un libro al mes. Con más de un millón de libros sobre más de 1000 categorías, ¡tenemos todo lo que necesitas! Obtén más información aquí.
¿Perlego ofrece la función de texto a voz?
Busca el símbolo de lectura en voz alta en tu próximo libro para ver si puedes escucharlo. La herramienta de lectura en voz alta lee el texto en voz alta por ti, resaltando el texto a medida que se lee. Puedes pausarla, acelerarla y ralentizarla. Obtén más información aquí.
¿Es Advanced Data Science and Analytics with Python un PDF/ePUB en línea?
Sí, puedes acceder a Advanced Data Science and Analytics with Python de Jesus Rogel-Salazar en formato PDF o ePUB, así como a otros libros populares de Economics y Statistics for Business & Economics. Tenemos más de un millón de libros disponibles en nuestro catálogo para que explores.

Información

Año
2020
ISBN
9780429822315

1

No Time to Lose: Time Series Analysis

HAVE YOU EVER WONDERED WHAT the weather, financial prices, home energy usage, and your weight all have in common? Well, appart from the obvious, the data to analyse these phenomena can be collected at regular intervals over time. Common sense, right? Well, there is no time to lose; let us take a deeper look into this exciting kind of data. Are you ready?
Not obvious? Oh… well, read on!
Or is it Toulouse, like “Toulouse” in France?
A time series is defined as a sequence of data reading in successive order and can be taken on any variable that changes over time. So, if a time series is a set of data collected over time, then a lot of things, not just our weight or the weather, would be classed as time series, and perhaps that is true. There are, obviously and quite literally, millions of data points that can be collected over time. However, time series analysis is not necessarily immediately employed.
A lot of data is collected over time, but that does not make the data set a time series.
Time series analysis encapsulates the methods used to understand the sequence of data points mentioned above and extract useful information from it. A main goal is that of forecasting successive future values of the series. In this chapter we will cover some of these methods. Let us take a look.

1.1 Time Series

KNOWING HOW TO MODEL TIME series is surely an important tool in our Jackalope data scientist toolbox. Jackalopes? Yes! Long story… You can get further information in Chapter 1 of Data Science and Analytics with Python.1. But I digress, the key point about time series data is that the ordering of the data points in time matters. For many datasets it is not important in which order the data are obtained or listed. One order is as good as another, and although the ordering may tell us something about the dataset, it is not an inherent attribute of the set.
See for instance the datasets analysed in the book mentioned above.
However, for time series data the ordering is absolutely crucial. The order imposes a certain structure on the data, which in turn is of relevance to the underlying phenomenon studied. So, what is different about time series? Well, Time! Furthermore, we will see later on in this chapter that in some cases there are situations where future observations are influenced by past data points. All in all, this is not a surprising statement; we are well acquainted with causality relationships.
What is different about time series? —Time!
Let us have a look at an example of a time series. In Figure 1.1 we can see a financial time series corresponding to the log returns of Apple for a year starting in April 2017. The log returns are used to determine the proportional amount you might get on a given day compared to the previous one. With that description in mind, we can see how we are relating the value on day n to the one on day n − 1.
The log return is given by log(FVPV), where FV is the future value and PV is the past value.
Image
Figure 1.1: A time series of the log returns for Apple Inc. for a year since April 2017.
In that way, a...

Índice