Glossary
TIBCO® Data Science Team Studio defines certain product-related terms and uses them in the user interface and in the documentation. Understanding the meaning of these terms can help you to work in Team Studio.
C
- comment
- People can comment on anything in the Activity pane in the workspace overview.
- custom operator
- One can integrate their own algorithms and processes into Team Studio analytics engine using the Custom Operator Framework. Custom Operators are written using Scala or Java and can harness the power of Spark for advanced machine learning and transformations.
D
- dataset
- Datasets come from databases, HDFS, or uploaded files such as CSVs. Datasets are part of a workspace, and they are used within workflows, Touchpoints, and sandboxes in order to perform analyses.
- data source
- An external data provider, either a Hadoop database or a relational database.
- design time
- The actions a person takes before running an operator or workflow. He or she is "designing"/customizing the operator's parameters. This is an important concept to understand when learning about custom operators.
M
- milestone
- A section of work that a team member or group of members works on. Milestones include a due date that enables people to see at a glance the progress of a particular analytic project. Milestones are shown on the workspace page under the Milestones tab, and also on the workspace overview. One can also change the status of the project to one of the three available options: On Track, Needs Attention, or At Risk.
N
- note
- People can make notes on a workspace or any workfile within that space. Notes will show under the Activity pane in the workspace overview and will be viewable to those with access to the workspace. Notes can be promoted to Insights and commented on by other people.
- notification
- Notifications alert people of important changes in the application, such as job results or collaboration information. When someone adds another person to the workspace, that person gets a notification. Many types of activities can have notifications, which can be configured individually.
O
- operator
- An operator encapsulates some algorithm or transformation within a workflow. Operators show up as a list on the sidebar of the application within the workflow editor, and can be dragged to the canvas and connected to other operators or data sources. They are one of the main building blocks for workflows (the other being data sources). People can filter the operators based on the intended operation such as Load, Explore, Transform, Model, Predict, and Tools.
S
- source operator
- An operator that starts a workflow. Contrast with terminal operator.
- shared account
- An account that can be used to provide access to a database data source for multiple members of an organization. This means that people will share a single set of credentials for the data source.
- sandbox
- A place on the Data tab where people can bring in their training schema and perform simple explorations with the help of visualizations. One can also create external views from the sandbox datasets. Sandboxes are only available for database data sources.
T
- tag
- Tags can be used to categorize datasets or results within the application. Tags can be added to any work file. Selecting a tag brings up a view of all work files with that tag.
- Team Studio analytics model
- A model that is generated from a workflow and is saved as an .am file. This allows the portability of models from one workflow to the other without having to build the whole model again. The Alpine models are saved under the Work Files tab of a workspace.
- terminal operator
- An operator that no other operator can directly follow in the workflow. Contrast with source operator.
- Touchpoints
- Touchpoints wrap the functionality of complex workflows in an interactive application that can be consumed by business owners and people who are not data scientists.
W
- workflow
- A collection of datasets and operators that perform analytic tasks. A workflow is where data scientists build out models using the available operators and algorithms in Team Studio
- workflow variable
- People can override default parameters and define their own workflow-wide variables using a workflow variable. Workflow variables can be used to set default paths and parameters for HDFS, configure operator parameters, and enable input via Touchpoints.
- workspace
- Workspaces allow team members to collaborate on a data science project. Workspaces hold work files and can also have scheduled jobs, milestones, and associated database and Hadoop datasets under the Data tab to keep track of progress on a project. A workspace can be either public or private. People can create a workspace from the workspace page.
- workfile
- Any file that is within the Work Files section within the workspace. Work files can be workflows, SQL files, CSVs, Touchpoints, Alpine models , or result files saved to the workspace. In addition, people can upload files (such as images or .zip files) to make them part of the workspace.
Copyright © Cloud Software Group, Inc. All rights reserved.