eBook - ePub

Elasticsearch Server - Third Edition

Name: Elasticsearch Server - Third Edition
ISBN: 9781785888816

Rafal Kuc,

Marek Rogozinski,

556 pages
English
ePUB (mobile friendly)
Available on iOS & Android

eBook - ePub

Elasticsearch Server - Third Edition

Rafal Kuc,

Marek Rogozinski,

About this book

Leverage Elasticsearch to create a robust, fast, and flexible search solution with ease

About This Book

Boost the searching capabilities of your system through synonyms, multilingual data handling, nested objects and parent-child documents
Deep dive into the world of data aggregation and data analysis with ElasticSearch
Explore a wide range of ElasticSearch modules that define the behavior of a cluster

Who This Book Is For

If you are a competent developer and want to learn about the great and exciting world of ElasticSearch, then this book is for you. No prior knowledge of Java or Apache Lucene is needed.

What You Will Learn

Configure, create, and retrieve data from your indices
Use an ElasticSearch query DSL to create a wide range of queries
Discover the highlighting and geographical search features offered by ElasticSearch
Find out how to index data that is not flat or data that has a relationship
Exploit a prospective search to search for queries not documents
Use the aggregations framework to get more from your data and improve your client's search experience
Monitor your cluster state and health using the ElasticSearch API as well as third-party monitoring solutions
Discover how to properly set up ElasticSearch for various use cases

In Detail

ElasticSearch is a very fast and scalable open source search engine, designed with distribution and cloud in mind, complete with all the goodies that Apache Lucene has to offer. ElasticSearch's schema-free architecture allows developers to index and search unstructured content, making it perfectly suited for both small projects and large big data warehouses, even those with petabytes of unstructured data.

This book will guide you through the world of the most commonly used ElasticSearch server functionalities. You'll start off by getting an understanding of the basics of ElasticSearch and its data indexing functionality. Next, you will see the querying capabilities of ElasticSearch, followed by a through explanation of scoring and search relevance. After this, you will explore the aggregation and data analysis capabilities of ElasticSearch and will learn how cluster administration and scaling can be used to boost your application performance. You'll find out how to use the friendly REST APIs and how to tune ElasticSearch to make the most of it. By the end of this book, you will have be able to create amazing search solutions as per your project's specifications.

Style and approach

This step-by-step guide is full of screenshots and real-world examples to take you on a journey through the wonderful world of full text search provided by ElasticSearch.

Tools to learn more effectively

Saving Books

Keyword Search

Annotating Text

Listen to it instead

Information

Publisher

Packt Publishing

Year

2016

eBook ISBN

9781785888816

Edition

Topic

Computer Science

Subtopic

Web Development

Index

Computer Science

Elasticsearch Server Third Edition

Credits

About the Authors

About the Reviewer

www.PacktPub.com

eBooks, discount offers, and more

Why subscribe?

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

1. Getting Started with Elasticsearch Cluster

Full text searching

The Lucene glossary and architecture

Input data analysis

Indexing and querying

Scoring and query relevance

The basics of Elasticsearch

Key concepts of Elasticsearch

Index

Document

Document type

Mapping

Key concepts of the Elasticsearch infrastructure

Nodes and clusters

Shards

Replicas

Gateway

Indexing and searching

Installing and configuring your cluster

Installing Java

Installing Elasticsearch

Running Elasticsearch

Shutting down Elasticsearch

The directory layout

Configuring Elasticsearch

The system-specific installation and configuration

Installing Elasticsearch on Linux

Installing Elasticsearch using RPM packages

Installing Elasticsearch using the DEB package

Elasticsearch configuration file localization

Configuring Elasticsearch as a system service on Linux

Elasticsearch as a system service on Windows

Manipulating data with the REST API

Understanding the REST API

Storing data in Elasticsearch

Creating a new document

Automatic identifier creation

Retrieving documents

Updating documents

Dealing with non-existing documents

Adding partial documents

Deleting documents

Versioning

Usage example

Versioning from external systems

Searching with the URI request query

Sample data

URI search

Elasticsearch query response

Query analysis

URI query string parameters

The query

The default search field

Analyzer

The default operator property

Query explanation

The fields returned

Sorting the results

The search timeout

The results window

Limiting per-shard results

Ignoring unavailable indices

The search type

Lowercasing term expansion

Wildcard and prefix analysis

Lucene query syntax

Summary

2. Indexing Your Data

Elasticsearch indexing

Shards and replicas

Write consistency

Creating indices

Altering automatic index creation

Settings for a newly created index

Index deletion

Mappings configuration

Type determining mechanism

Disabling the type determining mechanism

Tuning the type determining mechanism for numeric types

Tuning the type determining mechanism for dates

Index structure mapping

Type and types definition

Fields

Core types

Common attributes

String

Number

Boolean

Binary

Date

Multi fields

The IP address type

Token count type

Using analyzers

Out-of-the-box analyzers

Defining your own analyzers

Default analyzers

Different similarity models

Setting per-field similarity

Available similarity models

Configuring default similarity

Configuring BM25 similarity

Configuring DFR similarity

Configuring IB similarity

Batch indexing to speed up your indexing process

Preparing data for bulk indexing

Indexing the data

The _all field

The _source field

Additional internal fields

Introduction to segment merging

Segment merging

The need for segment merging

The merge policy

The merge scheduler

Throttling

Introduction to routing

Default indexing

Default searching

Routing

The routing parameters

Routing fields

Summary

3. Searching Your Data

Querying Elasticsearch

The example data

A simple query

Paging and result size

Returning the version value

Limiting the score

Choosing the fields that we want to return

Source filtering

Using the script fields

Passing parameters to the script fields

Understanding the querying process

Query logic

Search type

Search execution preference

Search shards API

Basic queries

The term query

The terms query

The match all query

The type query

The exists query

The missing query

The common terms query

The match query

The Boolean match query

The phrase match query

The match phrase prefix query

The multi match query

The query string query

Running the query string query against multiple fields

The simple query string query

The identifiers query

The prefix query

The fuzzy query

The wildcard query

The range query

Regular expression query

The more like this query

Compound queries

The bool query

The dis_max query

The boosting query

The constant_score query

The indices query

Using span queries

A span

Span term query

Span first query

Span near query

Span or query

Span not query

Span within query

Span containing query

Span multi query

Performance considerations

Choosing the right query

The use cases

Limiting results to given tags

Searching for values in a range

Boosting some of the matched documents

Ignoring lower scoring partial queries

Using Lucene query syntax in queries

Handling user queries without errors

Autocomplete using prefixes

Finding terms similar to a given one

Matching phrases

Spans, spans everywhere

Summary

4. Extending Your Querying Knowledge

Filtering your results

The context is the key

Explicit filtering with bool query

Highlighting

Getting started with highlighting

Field configuration

Under the hood

Forcing highlighter type

Configuring HTML tags

Controlling highlighted fragments

Global and local settings

Require matching

Custom highlighting query

The Postings highlighter

Validating your queries

Using the Validate API

Sorting data

Default sorting

Selecting fields used for sorting

Sorting mode

Specifying behavior for missing fields

Dynamic criteria

Calculate scoring when sorting

Query rewrite

Prefix query as an example

Getting back to Apache Lucene

Query rewrite properties

Summary

5. Extending Your Index Structure

Indexing tree-like structures

Data structure

Analysis

Indexing data that is not flat

Data

Objects

Arrays

Mappings

Final mappings

Sending the mappings to Elasticsearch

To be or not to be dynamic

Disabling object indexing

Using nested object...

Elasticsearch Server Third Edition

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription

No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline

Perlego offers two plans: Essential and Complete

Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.

Both plans are available with monthly, semester, or annual billing cycles.

We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 990+ topics, we’ve got you covered! Learn about our mission

Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud

Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app

Yes, you can access Elasticsearch Server - Third Edition by Rafal Kuc, Marek Rogozinski in PDF and/or ePUB format, as well as other popular books in Computer Science & Web Development. We have over one million books available in our catalogue for you to explore.

Elasticsearch Server - Third Edition

Elasticsearch Server - Third Edition

About this book

Tools to learn more effectively

Information

Elasticsearch Server Third Edition

Table of Contents

Table of contents

Frequently asked questions