General Tab

The fields under the Parse Copybook Data activity General tab are described here.

Field Literal Value/Module Property? Description
Name No The activity’s label name in the process definition.
Copybook Resource No The Copybook Resource that defines the schema for parsing incoming data into XML.
Big Endian Yes The byte order of the incoming data. Check this field to specify the byte order as Big Endian, for example, for IBM mainframes and certain UNIX platforms. Uncheck this field if the byte order is Little Endian, for example, for Microsoft Windows and certain UNIX platforms.
Character Set Yes The character-set encoding for the string values in the incoming data except for COBOL items with DISPLAY-1, also affecting the parsing of zoned decimal values. For details, see Handling of Zoned Decimal Items.

You can choose either single-byte or multibyte character sets from the selector dialog. Multibyte character sets are marked with an asterisk for easy identification.

Warning: Multibyte character sets are allowed for COBOL copybooks only. Although you can choose those sets, choosing one in conjunction with a PL/I or RPG copybook causes an error.

For details on how character sets work in the Plug-in, see Latin Normalization in Character Sets.

Character Set for DISPLAY-1 Items Yes The character-set encoding for the string values in the incoming data for COBOL items with USAGE DISPLAY-1. This field is available only if you select a COBOL copybook.

You can choose either single-byte or multibyte character sets from the selector dialog. Multibyte character sets are marked with an asterisk for easy identification.

For details on how character sets work in the Plug-in, see Latin Normalization in Character Sets.

Multiple Records Yes The field that instructs the activity to repeat the copybook parsing until the end of the input data is reached and to generate multiple output elements.
Records Delimiter Yes The field that specifies the delimiter between records in the input data in case of a checked Multiple Records field. You can specify the following valid delimiter types:

None — Specifies that a new record begins directly after the last record ends. No characters separate the records.

New Line — Specifies that a new-line character separates the records in the input.

Carriage Return — Specifies that a carriage-return character separates the records in the input.

Carriage Return/Line Feed (Windows) — Specifies that a carriage-return character followed by a line-feed character separates the records in the input.

When specifying this field by Module or Process properties, you can use the following values:

  • None: none
  • New Line: nl, New Line
  • Carriage Return: cret, cr, Carriage Return
  • Carriage Return/Line Feed (Windows): crlf, Carriage Return/Line Feed (Windows)

Note that the previous values are not case-sensitive.

Trim Whitespaces from 
String Values Yes The field that specifies whether to trim white spaces of the text-converted values after parsing. Here are the white-space characters and their descriptions:

\u0009 – Horizontal tabulation
\u000A – Line feed
\u000B – Vertical tabulation
\u000C – Form feed
\u000D – Carriage return
\u001C – File separator
\u001D – Group separator
\u001E – Record separator
\u001F – Unit separator
\u0000 – Null character

Note: Internally, the Plug-in calls a trim() method of the Java class String.

This field also affects how the Plug-in matches the REDEFINE control field values. For details, see Configuration of Data-Description Entries for a REDEFINES group Or Union.

Empty Nodes Yes The field that specifies whether to preserve empty elements in the output XML. If you uncheck this field, the Plug-in deletes the empty elements from the XML data, thus reducing the size of the XML output.
Input Type No The field that specifies the type of input for this activity. The type can be either of the following:
  • A byte array, in which case you must specify a base64Binary value for the bytes input field.
  • A file, in which case you must specify the file name and location for the fileName input field.
Return Remaining Bytes No Enable returning of the remainder of the input base64Binary. If selected, the output of the activity contains a field remainingBytes. This field contains the tailing portion of the input base64Binary that has not been processed by this activity. See Connecting Multiple Parse Activities.
Allow Incomplete Records Yes This field specifies whether the activity requires that the input contains enough input data for an entire copybook layout (record) or incomplete layouts are tolerated. This field is only available for COBOL copybooks.

See Parsing Incomplete Records for more details.

Allow Incomplete Text Items Yes This field specifies that the last text item is allowed to have less bytes than the copybook implies. This field is only available for COBOL copybooks and it is enabled only if Allow Incomplete Records is selected. See Parsing Incomplete Records for more details.
Ignore Invalid Items Yes This field turns off errors resulting from invalid content in the binary data in an elementary item. This field is only available for COBOL copybooks.

See Validation of Elementary Items.

Check String Values for Invalid XML Characters Yes

This field specifies whether, during parsing, the activity checks for invalid XML characters in XML output. Invalid XML characters can occur depending on binary content and character set selections.

If both the Check String Values for Invalid XML Characters field and the Ignore Invalid Items field are set to Yes, an invalid XML character will not cause an activity error, but the character is not included in the activity output.

If only the Check String Values for Invalid XML Characters field is set to Yes, and the Ignore Invalid Items field is set to No, the activity fails.

If the Check String Values for Invalid XML Characters field is not set to Yes, the activity does not check for invalid XML characters, regardless of the value of the Ignore Invalid Items field. Not checking for invalid characters might cause incremental performance improvements, but this setting should only be used when data is guaranteed to be XML compliant.

See Dealing with Invalid XML Characters.