Contents
This adapter sample illustrates the use of the TIBCO StreamBase® Text File Reader Adapter for Apache Hadoop Distributed File System (HDFS) by reading a file and emitting a tuple that contains the file's contents in a string field.
The MyFile.txt
file used in this sample must be placed on your HDFS file system before this sample can run.
You must also open the filereader.sbapp
file and select the Parameters tab and edit the HDFS_FILE_PATH
and HDFS_USER
values to represent your HDFS setup, as well as to be able to access the required files. The .sbapp
file is located in → →
-
In the Project Explorer, open the sample you just loaded.
-
Open the
src/main/eventflow
folder. -
Open the package folder (most samples contain a single package folder. Open the top-level package folder if your sample contains more than one folder).
-
Open the named application file and click the
Run button. This opens the SB Test/Debug perspective and starts the application.
If you see red marks, wait a moment for the project in Studio to load its features.
If red marks do not resolve themselves in a moment, select the project and right-click
→ from the context menu. -
In the Manual Input view, click
to send the defaultnull
tuple. -
In the Output Streams view, observe tuples emitted on the
Status
andData
output streams, the latter of which contains the contents of the configured default file,MyFile.txt
. -
Press F9 or click the
Stop Running Application button.
In StreamBase Studio, import this sample with the following steps:
-
From the top-level menu, select
→ . -
Type
hdfs
to narrow the list of options. -
Select hdfsfilereader from the Large Data Storage and Analysis category.
-
Click OK.
StreamBase Studio creates a project for this sample.
When you load the sample into StreamBase Studio, Studio copies the sample project's files to your Studio workspace, which is normally part of your home directory, with full access rights.
Important
Load this sample in StreamBase Studio, and thereafter use the Studio workspace copy of the sample to run and test it, even when running from the command prompt.
Using the workspace copy of the sample avoids permission problems. The default workspace location for this sample is:
studio-workspace
/sample_adapter_embedded_hdfsfilereader
See Default Installation Directories for the default location of studio-workspace
on your system.