Setting up Apache Spark
For the Big Data Import feature of TIBCO MDM, you must download and configure Apache Spark. Apache Spark cluster includes a single master and any number of worker nodes. For high and efficient performance, configure four or five worker nodes.
Prerequisites
- For the recommended platform, see Platform Limitations for Apache Spark.
- Share the
$MQ_HOME,
$MQ_COMMON_DIR, and
MQ_CONFIG_FILE directories across all worker nodes (from TIBCO MDM host machine to Apache Spark master and worker machines).
For information about sharing the directory in the cluster environment, see Clustering Set Up.
Procedure
Copyright © Cloud Software Group, Inc. All rights reserved.