Connect to a YARN-Enabled Data Source

To connect to a YARN-enabled cluster, Team Studio must have access to the following ports on each node of the cluster:

Prerequisites

Port Configuration Parameter
8020 fs.default.name/dfs.namenode.rpc-address.<nameservice>.namenode<x>
8030 yarn.resourcemanager.scheduler.address
8031 yarn.resourcemanager.resource-tracker.address
8032 yarn.resourcemanager.address
8088 yarn.resourcemanager.webapp.address
10020 mapreduce.jobhistory.address
50010 dfs.datanode.address
50020 dfs.datanode.ipc.address
user-specified (or blank) yarn.app.mapreduce.job.client.port-range
Note: This parameter is unspecified on many clusters, meaning that Team Studio chooses an arbitrary open port when it runs MapReduce jobs. Instead, you can set this parameter to a specific port range either in the Team Studio data source configuration or on the cluster.

For more options, see Team Studio Default Ports.