Creating an HDFS Connection

You can create an HDFS Connection shared resource if you want to build a connection between the plug-in and HDFS.

Prerequisites

The HDFS Connection shared resource is available at the Resources level. Ensure that you have created a project, as described in Creating a Project.

Procedure

  1. Expand the created project in the Project Explorer view.
  2. Right-click the Resources folder and click New > HDFS Connection.
  3. The resource folder, package name, and resource name of the shared resource are provided by default. You can change these default configurations accordingly. Click Finish to open the HDFS Connection shared resource editor.
  4. In the HDFS Url field, specify the WebHDFS URL or the HttpFS URL that is used to connect to HDFS.
  5. In the User Name field, specify the unique user name that is used to connect to HDFS.
  6. Click Test Connection to validate the connection.
  7. Optional: If you want to connect to a Kerberized HDFS server, select the Enable Kerberos check box, and then set up a connection to the Kerberized HDFS server.
    Note: If your server uses the AES-256 encryption, you must install Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction Policy Files on your machine. For more details, see Installing JCE Policy Files.
    1. From the Kerberos Method list, select a Kerberos authentication method:
      • Keytab: specify a keytab file to authorize access to HDFS.
      • Cached: use the cached credentials to authorize access to HDFS.
      • Password: specify a password for the Kerberos principal.
    2. In the Kerberos Principal field, specify the Kerberos principal name that is used to connect to HDFS.
    3. If you select Keytab as the authentication method, specify the keytab that is used to connect to HDFS in the Kerberos Keytab field.
    4. Click Test Connection to validate the connection.