Clojure Data Analysis Cookbook - Second Edition
eBook - ePub

Clojure Data Analysis Cookbook - Second Edition

Eric Rochester

Share book
  1. 372 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Clojure Data Analysis Cookbook - Second Edition

Eric Rochester

Book details
Book preview
Table of contents
Citations

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on ā€œCancel Subscriptionā€ - itā€™s as simple as that. After you cancel, your membership will stay active for the remainder of the time youā€™ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlegoā€™s features. The only differences are the price and subscription period: With the annual plan youā€™ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weā€™ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Clojure Data Analysis Cookbook - Second Edition an online PDF/ePUB?
Yes, you can access Clojure Data Analysis Cookbook - Second Edition by Eric Rochester in PDF and/or ePUB format, as well as other popular books in Informatica & Visualizzazione di dati. We have over one million books available in our catalogue for you to explore.

Information

Year
2015
ISBN
9781784390297

Clojure Data Analysis Cookbook Second Edition


Table of Contents

Clojure Data Analysis Cookbook Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why subscribe?
Free access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Importing Data for Analysis
Introduction
Creating a new project
Getting ready
How to do it...
How it works...
Reading CSV data into Incanter datasets
Getting ready
How to do itā€¦
How it worksā€¦
There's moreā€¦
Reading JSON data into Incanter datasets
Getting ready
How to do itā€¦
How it worksā€¦
Reading data from Excel with Incanter
Getting ready
How to do itā€¦
How it worksā€¦
Reading data from JDBC databases
Getting ready
How to do itā€¦
How it worksā€¦
See also
Reading XML data into Incanter datasets
Getting ready
How to do itā€¦
How it worksā€¦
There's moreā€¦
Navigating structures with zippers
Processing in a pipeline
Comparing XML and JSON
Scraping data from tables in web pages
Getting ready
How to do itā€¦
How it worksā€¦
See also
Scraping textual data from web pages
Getting ready
How to do itā€¦
How it worksā€¦
Reading RDF data
Getting ready
How to do itā€¦
How it worksā€¦
See also
Querying RDF data with SPARQL
Getting ready
How to do itā€¦
How it worksā€¦
There's moreā€¦
Aggregating data from different formats
Getting ready
How to do itā€¦
Creating the triple store
Scraping exchange rates
Loading currency data and tying it all together
How it worksā€¦
See also
2. Cleaning and Validating Data
Introduction
Cleaning data with regular expressions
Getting ready
How to do itā€¦
How it worksā€¦
There's more...
See also
Maintaining consistency with synonym maps
Getting ready
How to do itā€¦
How it worksā€¦
See also
Identifying and removing duplicate data
Getting ready
How to do itā€¦
How it worksā€¦
There's moreā€¦
Regularizing numbers
Getting ready
How to do itā€¦
How it worksā€¦
Calculating relative values
Getting ready
How to do itā€¦
How it worksā€¦
Parsing dates and times
Getting ready
How to do itā€¦
There's moreā€¦
Lazily processing very large data sets
Getting ready
How to do itā€¦
How it worksā€¦
Sampling from very large data sets
Getting ready
How to do itā€¦
Sampling by percentage
Sampling exactly
How it worksā€¦
Fixing spelling errors
Getting ready
How to do itā€¦
How it worksā€¦
There's moreā€¦
Parsing custom data formats
Getting ready
How to do itā€¦
How it worksā€¦
Validating data with Valip
Getting ready
How to do itā€¦
How it worksā€¦
3. Managing Complexity with Concurrent Programming
Introduction
Managing program complexity with STM
Getting ready
How to do itā€¦
How it worksā€¦
See also
Managing program complexity with agents
Getting ready
How to do itā€¦
How it worksā€¦
See also
Getting better performance with commute
Getting ready
How to do itā€¦
How it worksā€¦
Combining agents and STM
Getting ready
How to do itā€¦
How it worksā€¦
Maintaining consistency with ensure
Getting ready
How to do itā€¦
How it worksā€¦
Introducing safe side effects into the STM
Getting ready
How to do itā€¦
Maintaining data consistency with validators
Getting ready
How to do itā€¦
How it worksā€¦
See also
Monitoring processing with watchers
Getting ready
How to do itā€¦
How it worksā€¦
Debugging concurrent programs with watchers
Getting ready
How to do itā€¦
There's more...
Recovering from errors in agents
How to do itā€¦
Failing on errors
Continuing on errors
Using a custom error handler
There's more...
Managing large inputs with sized queues
How to do itā€¦
How it works...
4. Improving Performance with Parallel Programming
Introduction
Parallelizing processing with pmap
How to do itā€¦
How it worksā€¦
There's moreā€¦
See also
Parallelizing processing with Incanter
Getting ready
How to do itā€¦
How it worksā€¦
Partitioning Monte Carlo simulations for better pmap performance
Getting ready
How to do itā€¦
How it worksā€¦
Estimating with Monte Carlo simulations
Chunking data for pmap
Finding the optimal partition size with simulated annealing
Getting ready
How to do itā€¦
How it worksā€¦
There's moreā€¦
Combining function calls with reducers
Getting ready
How to do itā€¦
What happened here?
There's more...
See also
Parallelizing with reducers
Getting ready
How to do itā€¦
How it worksā€¦
See also
Generating online summary statistics for data streams with reducers
Getting ready
How to do itā€¦
Using type hints
Getting ready
How to do itā€¦
How it worksā€¦
See also
Benchmarking with Criterium
Getting ready
How to do itā€¦
How it worksā€¦
See also
5. Distributed Data Processing with Cascalog
Introduction
Initializing Cascalog and Hadoop for distributed processing
Getting ready
How to do itā€¦
How it worksā€¦
See also
Querying data with Cascalog
Getting ready
How to do itā€¦
How it worksā€¦
There's more
Distributing data with Apache HDFS
Getting ready
How to do itā€¦
How it worksā€¦
Parsing CSV files with Cascalog
Getting ready
How to do itā€¦
How it worksā€¦
There's more
Executing complex queries with Cascalog
Getting ready
How to do itā€¦
Aggregating data with Cascalog
Getting ready
How to do itā€¦
There's more
Defining new Cascalog operators
Getting ready
How to do itā€¦
Creating map operators
Creating map concatenation operators
Creating filter operators
C...

Table of contents