Prerequisites for Big Data Import

Before uploading huge data using the Big Data Import approval option, you must first perform the following actions:

  • Install and configure Apache Spark and Apache Hadoop Distributed File System (HDFS). For configuration, see the "Setting Up Apache Spark" and "Setting Up Hadoop Distributed File System" sections in TIBCO MDM Installation and Configuration Guide.
  • Understand the Apache Spark and HDFS architecture for big data import processing. For information, see Big Data Import Processing with Apache Spark and HDFS.
  • Create an input map using the data source in TIBCO MDM Studio. For information, see TIBCO MDM Studio Repository Designer User’s Guide. Additionally, see Approval Options.
    Note: Input maps are supported based on only one data source.