Use this connection to communicate between Apache Hbase® and Minitab Connect.

Set up a new connection

Complete the following steps to set up a connection in Minitab Connect.
  1. From the Home screen, select the Add button under Tables.
  2. In Datasets, enter Hbase and select the connector from the results.
  3. Enter a Name that identifies the connection. Under Beacon, specify which beacon you want the connection to go through if you have multiple beacons. Usually, the default works best.
  4. Under Setup [1 of 2], enter a Hostname and Port number for the connection.
  5. Enter a Username and Password if your account requires one. Then select Next.
  6. Minitab Connect displays a message whether the connection is successful or whether there is an error with the connection. If the connection is successful, select Save .

After you save the connection, Connect displays options to import data into a new table. Start at step 4 of the following section for more information on importing data to create a table.

Create a table with data from a connection

Complete the following steps to import data from Hbase and create a new table in Connect.
  1. From the Home screen, select the Add button under Tables.
  2. In Datasets, enter Hbase and select the connector from the results.
  3. Under Connection, select the Hbase connection that you previously set up. If you have not previously set up a connection, select New Connection and follow the steps in the previous section.
  4. On the left panel, enter a Name that identifies the table and select a Folder to save the table.
  5. Under Update Frequency, specify how often you want Connect to update the table. You can import the data one time or have Connect continuously import the data at a set time interval. Connect automatically creates a flow for your import. If you select Once, you can use the flow at a later time to automatically run the import again.
  6. Under Setup [1 of 4], select the Table that contains the data.
  7. In Limit, select how you want to limit the number of pulls. Select No if you don't any limits.
    • Select Yes: Last X Days to pull values from only the most recent days. In Days, enter the number of days.
    • Select Yes: Choose Dates to pull values within a certain time range. Enter the dates for the range in Start Time and End Time.
  8. In Batch Size, enter the maximum number of records you want to pull for each batch. Usually, the default value works best.
  9. In Batch Limit, enter the maximum number batches you want to pull. Leave this value at 0 to pull at batches. Select Next.
  10. In Column Families, enter text to filter the list. Then select the fields that you want to import and select Next.
  11. In Pull specific columns from field, enter a list of column names. Separate multiple names with a comma. If your column names contain commas, enter a different character to use as a separator in Alternate separator. Connect only pulls data from columns that match the names you enter. Leave the box empty to select all columns. Select Next.
  12. From Refresh Method, select how you want the table to refresh.
    Replace
    Replace existing records with imported records.
    Append
    Add all imported data to the existing data.
    Append New
    Add only new imported records to existing data.
    Update
    Add new imported records and update existing data.
  13. Select Save . If you go to the Prep Tool and select Run , Minitab Connect displays the imported data.
    Note

    If no data appears after you select Run, select Reset Config .

When you save the import, Connect creates a flow with the settings that you selected. For more information on how to schedule a flow and add more data processes to clean data, go to Overview of the Flow Tool .

Export a file

Complete the following steps to export data from a Minitab Connect table.
  1. Open the Outputs tab of the Flow Tool .
  2. Select the plus sign beside Export to add a new export.
  3. Under Export, select New Export. If you select One Time Download, Connect downloads the file to your computer and does not create an export.
  4. Enter a Name for the export.
  5. Under View, select a saved view of the table. If you select None, Connect exports all the data from the table. For more information on saved views, go to Example of creating a data view.
  6. Under Delivery, select Connection.
  7. Under Connection, select the connection that you previously set up.
  8. You can use a colon in your column names to indicate column families. For example, an export with a column named foo:bar will be stored in the 'bar' column of the 'foo' column family. Connect stores all columns without a 'colfam:' prefix in the column you enter in Default Column Family.
  9. In Row Key, enter the name of the field in your export that you want to use as the Row Key. A Row Key identifies a unique record in HBase. If you do not enter a name or Connect cannot find the name that you enter, Connect uses the first field in the table as the key.
  10. Select Yes in Include Row Key In Data to have the Row Key field appear in a column family. If you select No, Connect uses the field only as the Row Key.

You can specify the file type and file name along with other options. Select Save to add the export to the list of exports in the Outputs tab. Select Run to run the export.