CSV (Comma-Separated Value) Reader/Writer
- Reader Parameters

You are here: Popular Formats > CSV (Comma-Separated Value) Reader/Writer > Reader Parameters

CSV (Comma Separated Value) Reader Parameters

Dataset Parameters

Feature Type Name(s)

This parameter allows you to choose different naming schemes, and the number of feature types generated for the reader.

Feature type name choices:

From File Name(s): Generates one feature type per source filename.
From Format Name: Produces only a single feature type containing the format name.

Fields

Delimiter Character

The single character specified as the delimiter between values.

Field Names Line

The line number that contains the field names. Note that the first line in the file is considered to be line number 1. If the file does not contain field names, leave this blank.

When the file does not contain field names, the columns of the CSV table are given default names (for example, col0, col1, ..., colN).

Data Start Line

The line number at which the data starts. Note that the first line in the file is considered to be line number 1.

Advanced

Field Separation

Skip Duplicate Delimiters

If selected, multiple contiguous delimiters are treated as a single delimiter; otherwise, each delimiter is treated as if it delimits a different field.

Field Qualifier Character

Specifies the character that encloses field values. When a field starts with this character, all text that follows this character and precedes the next occurrence of the character will be treated as one value, even if that text contains a delimiter or newline character.

For example, if the delimiter is a comma (,) and the field qualifier is a quotation mark ("), then the value

"Vancouver, BC"

will be treated as one value

Vancouver, BC

rather than two separate values

Vancouver

Field Qualifier Escape Character

Specifies the character that escapes the field qualifier character. This is used when wanting to have a field qualifier literal within a field qualifier group.

For example, if the field qualifier character is a quotation mark (") and the escape character is a backslash (\), then the value

"Vancouver \"Lotusland\", BC"

will be read as

Vancouver "Lotusland", BC

Field Names

Case-Sensitive Field Names

Specifies whether the field names should be matched against the schema in a case-sensitive or case-insensitive manner.

For example, suppose the schema contains the attribute "MyField" but the file contains the field "myfield". If field names are case-sensitive, these are considered to not match, and the attribute "MyField" will not be read. On the other hand, if field names are not case-sensitive, these are considered to match, and values from the "myfield" column will be read for attribute "MyField".

Enforce Strict Schema

Specifies whether to enforce a strict schema.

If this parameter is set to Yes and the fields in the file do not match the attributes on the schema in FME, the reader will fail.
If this parameter is set to No, the reader will warn about any attributes that exist on the schema but are not present in the file, and will continue reading.

Field Values

Trim Field Qualifier Characters

Specifies whether to trim the field qualifier character from values. Note that these characters are only trimmed when they serve as field qualifiers (that is, the first character in the value is this character, until the next instance of that character).

For example, if the field qualifier is a quotation mark ("), then the value

"Vancouver, BC" and "More"

will be read as

Vancouver, BC and "More"

Encoding

Character Encoding

Specifies the file encoding to use when reading.

Preserve Source Encoding

Specifies whether string attributes will be set in the file encoding.

Yes: String attributes will always be in the encoding of the file.
No: String attributes may be in the file encoding, but may also be in a Unicode encoding. Setting this parameter to No may improve performance when reading from an encoded file.

Skipped Lines

Read Skipped Lines

Specifies whether to read lines from the file that occur before the data start line. (Note: The field name's line is never read as a feature.)

If set to Yes, the reader will produce features for these lines, where the attribute csv_skipped_lineis set to the content of that line.

Schema Generation

Maximum Rows to Scan

If the field structure of the first several rows of a file is representative of the remainder of the file, this option can be set to prevent FME from unnecessarily reading further rows from a potentially large file when determining its schema.

If left blank, there will be no limit and all rows will be read.

Note: This setting only applies to the schema generation; it does not limit the number of rows read when the translation is run.

Scan for Types

Specifies whether to try to determine the types of attributes when scanning for schema.

No: All attributes will be treated as strings.
Yes: FME will attempt to determine the correct type for each attribute (for example, int32, real64, etc.).

Using properly typed attributes may improve reading and processing performance. However, if an attribute value is not valid for a scanned type (for example, because the value was not included when scanning for schema), it will be set to null.

When scanning for types, FME will also attempt to automatically map fields to coordinates (for example, a field named x will be given a type of x_coordinate).

Scan for Additional Fields

Specifies whether to scan for additional fields, beyond those found in the field names row.

Yes: FME will attempt to find additional fields that aren’t included in the field names row.
No: The field names row is assumed to contain all fields in the file.

This option has no effect when the file does not contain field names.

Preview

Shows a preview of the input CSV dataset, as read with the current options.

Attributes

Shows the schema of the dataset, as read with the current options:

Column	Description
Read	Whether to read this field as attribute.
Name	The name of the attribute.
Type	The type of the attribute.

Schema Attributes

Additional Attributes to Expose

Use this parameter to expose Format Attributes in Workbench when you create a workspace:

In a dynamic scenario, it means these attributes can be passed to the output dataset at runtime.
In a non-dynamic scenario where you have multiple feature types, it is convenient to expose additional attributes using this one parameter. For example, if you have ten feature types and want to expose the same attribute in each one, it is easier to define it once than it is to set each feature type individually in the workspace.