Elastic Elasticsearch Reader/Writer

FME provides read and write access to Elasticsearch clusters (version 7+).

FME treats each document as a feature, and each field in a document is treated as an attribute.

Elasticsearch is an open source full-text search index. Elasticsearch indexes are JSON document stores that support LonLat or GeoJSON geometry.

More information about Elasticsearch can be found at www.elastic.co.

Note Version v6.8 and earlier (ELASTICSEARCH) was deprecated in an earlier FME version.

Product and System Requirements

Format	FME Platform			Operating System
Reader/Writer	FME Form	FME Flow	FME Flow Hosted	Windows 64-bit	Linux	Mac
Reader	Yes	Yes	Yes	Yes	Yes	Yes
Writer	Yes	Yes	Yes	Yes	Yes	Yes

Elasticsearch Datasets

Note Elasticsearch Types were removed in version 7.

FME Terminology	ELASTICSEARCH_CLUSTER (v7+) Terminology
Dataset	The dataset for Elasticsearch version 7 and later is a Cluster.
Feature Type	The feature type for Elasticsearch version 7 and later is an Index. Each Elasticsearch Cluster can contain multiple Indices.

Attribute Types and Attribute Index Types

The following Attribute Types have changed between Elasticsearch versions:

string
text
keyword

The following Attribute Index Types have changed between Elasticsearch versions:

Analyzed
NotAnalyzed

String is a field type from Elasticsearch v2 and earlier. string,Analyzed is exactly equivalent to text, and string,NotAnalyzed is exactly equivalent to keyword.

FME accepts text and keyword for FME Attribute Types.

The NotIndexed Attribute Index Type is supported for all Attribute Types.

Format Usage Notes

There are two types of Elasticsearch geometry fields: geo_point and geo_shape:
- geo_point fields can only contain point geometries
- geo_shape fields can contain any geometry that is representable as GeoJSON.
You can write features from most coordinate systems, but they will all be reprojected to LL-WGS84 when being converted to GeoJSON. The coordinate reference system for all GeoJSON coordinates is a geographic coordinate reference system, using the World Geodetic System 1984 (WGS 84) [WGS84] datum. [Reference: The GeoJSON Format]
Writer: If a non-point geometry is written to a geo_point geometry field, then the geometry will be converted to its centroid point before writing.
Writer: Each Elasticsearch document has a unique Document ID. This ID can be specified on a feature with an attribute selected in the Writer Feature Type Parameters. If a document with that ID already exists, then the translation will fail.

Reader Overview

The Elasticsearch reader supports reading multiple indices from the same Elasticsearch cluster. Because of this, a separate reader must be created for each Elasticsearch cluster.

The feature types must be defined in the workspace before they can be read.

Multiple Geometry

The Elasticsearch reader supports reading multiple geometry fields from the same Elasticsearch feature type. If there is more than one geometry field in the Elasticsearch Mapping, then geometry will be read as FME Multiple Geometry. Each geometry part will be named after the corresponding Elasticsearch geometry field.

Writer Overview

The Elasticsearch writer stores documents into a type associated with a Elasticsearch index. The Elasticsearch writer provides the following capabilities:

Index Creation

The Elasticsearch writer uses the information within the FME workspace to automatically create Elasticsearch indices as required. An index will be created when the first input feature is processed. If no features are sent to a feature type, then the corresponding index will not be created.

Each Index is created with a Mapping (schema) based on the feature type’s User Attributes. The fields of each JSON document that is written to the Index will be parsed according to that Mapping. If the document contains any fields that do not appear in the Mapping, then those fields will be automatically added to it. This can occur if the Document Source of the feature type is a JSON Attribute.

Multiple Geometry

The Elasticsearch reader supports writing to multiple geometry fields in the same Elasticsearch Mapping.

If there is more than one geometry field in the existing mapping, then a feature's geometry must have the same name as the destination Elasticsearch geometry field. Otherwise, no geometry will be written.

If a feature's FME Multiple Geometry has multiple parts, then each part can be written to a different Elasticsearch geometry field. Each part will be written to the Elasticsearch geometry field corresponding to its geometry part name, provided that the field exists.

Nested geometry fields can be created and/or written to by naming the geometry in the form:

<outer_name>.<inner_name>

For example, a geo_point geometry field called address.location would result in data similar to the following:

{

“address”: {

“location”: [ <lon>, <lat> ]

}

FME Online Resources

Search Elasticsearch