FME Transformers: 2026.1

Categories
Rasters
Web

Web

Related Transformers
AzureAIVisionConnector
AzureTextAnalyticsConnector
ComprehendConnector
GoogleBigQueryConnector
GoogleCloudPubSubConnector
GoogleCloudStorageConnector
GoogleDriveConnector
GoogleLanguageConnector
RekognitionConnector

GoogleVisionConnector

Connects to the Google Vision AI API for object detection in images.

Jump to Configuration

Typical Uses

  • Detecting labels, objects, faces, and text

How does it work?

The GoogleVisionConnector uses your Google Cloud account credentials to connect to Google Vision and submit images for analysis.

Images may be provided as local files, Google Storage objects, URLs, or as raster geometry, and each one may produce multiple output features.

Services supported are:

Detection Type Analysis

Document Text Detection

Detects and extracts text from an image.

It is optimized for dense text and documents, such as an image of a handwritten document with blocks, paragraphs, words and symbols.

Results include the entire extracted strings for blocks and paragraphs, as well as individual words and symbols.

Face Detection

Successfully identified faces will result in output features with attributes describing the face.

Each feature will have a bounding box for the face.Facial landmarks such as LEFT_EYE, NOSE_TIP, or LEFT_EYE_PUPIL are added as additional point geometries on the feature.

When using any input image source, the bounding box is in pixel units, and will align with the input.

Object Detection

Detects and extract information about objects in an image, across a broad group of categories.

Labels can identify general objects, locations, activities, animal species, products, and more.

Detected objects will have a bounding box geometry returned. Bounding boxes are in pixel units, and will align with raster and local file inputs. Bounded boxes for URL inputs are returned as normalized values between 0 and 1.

Text Detection

Detects and extracts text from any image.

Results include the entire extracted string, as well as individual words and their bounding boxes.

When using any input image source, the bounding box is in pixel units, and will align with the input.

Optional Input Port

This transformer has two modes, depending on whether a connector is attached to the Input port or not:

  • Input-driven: When input features are connected, the transformer runs once for each feature it receives in the Input port.
  • Run Once: When no input features are connected, the transformer runs one time.

When the Input port is in use, the Initiator output port is also enabled.

Configuration

Input Ports

Output Ports

Parameters

Editing Transformer Parameters

Transformer parameters can be set by directly entering values, using expressions, or referencing other elements in the workspace such as attribute values or user parameters. Various editors and context menus are available to assist. To see what is available, click beside the applicable parameter.

For more information, see Transformer Parameter Menu Options.

Reference

Processing Behavior

Feature-Based

Feature Holding

No

Dependencies Google Cloud Account with access to the Cloud Vision API
Aliases  
History Released FME 2019.2

FME Online Resources

The FME Community and Support Center Knowledge Base have a wealth of information, including active forums with 35,000+ members and thousands of articles.

Search for all results about the GoogleVisionConnector on the FME Community.

 

Examples may contain information licensed under the Open Government Licence – Vancouver, Open Government Licence - British Columbia, and/or Open Government Licence – Canada.