AWS EMR

These lists provide information and links for operators in Team Studio that you can use with a AWS EMR data source.

Data Extraction Operators
Copy To Database
Hadoop File
Load to Hive
Exploration Operators
Bar Chart
Box Plot
Correlation
Frequency
Histogram
Line Chart
Scatter Plot Matrix
Summary Statistics
Variable Selection
Transformation Operators
Aggregation
Batch Aggregation
Collapse
Column Filter
Correlation Filter - Hadoop
Distinct - Hadoop
Fuzzy Join
Join
Normalization
Null Value Replacement
Numeric to Text
One-Hot Encoding
Pivot
Reorder Columns
Replace Outliers
Row Filter
Sessionization
Set Operations
Sort By Multiple Columns
Transpose
Unpivot
Unstack
Variable
Wide Data Variable Selector - Chi Square /Anova
Wide Data Variable Selector - Correlations
Window Functions - Aggregate
Window Functions - Lag/Lead
Window Functions - Rank
Sampling Operators
Random Sampling
Resampling
Sample Selector
Modeling Operators
Alpine Forest Classification
Alpine Forest Regression
ARIMA Time Series
Association Rules
Collaborative Filter Trainer
Decision Tree - Hadoop
Gradient Boosting Classification
Gradient Boosting Regression
K-Means
Linear Regression
Logistic Regression
Naive Bayes
Neural Network
PCA
SVM Classification
Prediction Operators
Chi Square, Goodness of Fit
Chi Square, Independence Test
Classifier
Collaborative Filter Predictor
Collaborative Filter Recommender
N-gram Dictionary Loader
PCA Apply 1
Predictor
Model Validation Operators
Alpine Forest Evaluator
Classification Threshold Metrics
Confusion Matrix
Goodness of Fit
Lift
Regression Evaluator
ROC
T-Test Independent Samples
T-Test Paired Samples
T-Test Single Sample
Tool Operators
Convert
Export
Export to Excel
Export to SBDF
Flow Control
HQL Execute
Load Model
Note
Pig Execute
Python Execute
R Execute
Sub-Flow
Natural Language Processing (NLP) Operators
LDA Predictor
LDA Trainer
N-gram Dictionary Builder
Text Extractor
Text Featurizer
1 This operator is deprecated but not yet removed or replaced.