Dataset

This is a source operator that connects a database table or view. With this operator, you can incorporate data into the workflow.

Dataset

Information at a Glance

Note: This operator can only be used with TIBCO® Data Virtualization and Apache Spark 3.2 or later.

Parameter

Description
Category Load Data
Data source type TIBCO® Data Virtualization
Send output to other operators Yes
Data processing tool TIBCO® DV, Apache Spark 3.2 or later

The database table can subsequently be used in the following projects:

  • A data-mining algorithm
  • A prediction algorithm
  • A statistical analysis

Input

A Dataset operator does not take any input, since it is a source operator.

Configuration

The following table provides the configuration details for the Dataset operator.

Parameter Description
Notes Notes or helpful information about this operator's parameter settings. When you enter content in the Notes field, a yellow asterisk appears on the operator.
Data Source Specify the database connection for access to the database where the data set (Table or View) resides. The data source list is populated with the database data sources associated with the current workflow.
Schema Name Specify the schema name of the data set (Table or View). The list of schemas is populated with the schemes found in the selected Data Source.

Table Name Specify the name of the data set (Table or View). The list of tables is populated with the tables found in the selected Schema Name.

Output

Visual Output
None.
Data Output
A data set that corresponds to the specified database table or view.

Example

The following example illustrates the Dataset operator that connects a database table from the TIBCO Data Virtualization data source. This data source is incorporated into the workflow that the downstream operators use.

Dataset operator workflow.png
Data
None.
Parameter Setting
The parameter settings for the Dataset operator are as follows:
  • Data Source: TDV

  • Schema Name: Datasets_S3

  • Table Name: golfnew.csv

Output
The following figure displays the output for the parameter settings for the Dataset operator.
Dataset operator output - golfnew