Importing Hadoop Datasets
Follow this procedure to import a Hadoop dataset into a workspace's sandbox.
- Procedure
- Browse a Hadoop data source to obtain a list of directories and files.
- Select the CSV file, and from the right pane, click
Create as an External Table.
- From the Select workspace dropdown menu, browse a list of workspaces for which you are a member.
- In the
Table name box, choose a table name for your import.
Be sure that you use a table name that is valid for your database provider.TIBCO Data Science - Team Studio attempts to determine which delimiter your CSV file uses. If you use a non-standard delimiter or this determination is wrong, use the Delimiter command to choose a new delimiter.
A preview of the data in tabular format is displayed. Verify that this format is correct, and then click Create External Table.
Only workspaces with sandboxes are displayed.
Result The new external table is created in the sandbox schema of the workspace you choose.