Configure SparkR to use TIBCO Enterprise Runtime for R

The SparkR package is an R package that provides a front end for using the Apache™ Spark system for distributed computation. SparkR allows using R to invoke Spark jobs, which can then call R to perform computations on distributed worker nodes.

You can modify the SparkR source to call the TIBCO® Enterprise Runtime for R (TERR) engine rather than the R engine by following the instructions contained in this technical note.

Note: To use TERR with SparkR, you must be able to perform the following tasks.
  • Install Hadoop™ (with Yarn) and Spark.
  • Install open-source R.
  • Download the SparkR package source.
  • Modify and build the SparkR package.
  • Install TERR, version 3.2 or later, and link it to your modified SparkR package.