Importing Hadoop Datasets

Follow this procedure to import a Hadoop dataset into a workspace's sandbox.

    Procedure
  1. Browse a Hadoop data source to obtain a list of directories and files.
  2. Select the CSV file, and from the right pane, click Create as an External Table.
  3. From the Select workspace dropdown menu, browse a list of workspaces for which you are a member.
  4. Only workspaces with sandboxes are displayed.
  5. In the Table name box, choose a table name for your import.
    Be sure that you use a table name that is valid for your database provider.
    TIBCO Data Science - Team Studio attempts to determine which delimiter your CSV file uses. If you use a non-standard delimiter or this determination is wrong, use the Delimiter command to choose a new delimiter.

    A preview of the data in tabular format is displayed. Verify that this format is correct, and then click Create External Table.

Result The new external table is created in the sandbox schema of the workspace you choose.