Use this connection to communicate between Apache HiveTM and Minitab Connect.

Set up a new connection

Complete the following steps to set up a connection in Minitab Connect.
  1. From the Home screen, select the Add button under Tables.
  2. In Datasets, enter Hive and select the connector from the results.
  3. Enter a Name that identifies the connection. Under Beacon, specify which beacon you want the connection to go through if you have multiple beacons. Usually, the default works best.
  4. Under Setup [1 of 2], enter a Host and Port number for the connection. Then enter a Username and Password for your account. Then select Next.
  5. Select the Database and the Warehouse from which you want to pull data. Then select Next.
  6. Minitab Connect displays a message whether the connection is successful or whether there is an error with the connection. If the connection is successful, select Save .

After you save the connection, Connect displays options to import data into a new table. Start at step 4 of the following section for more information on importing data to create a table.

Create a table with data from a connection

Complete the following steps to import data from Hive and create a new table in Connect.
  1. From the Home screen, select the Add button under Tables.
  2. In Datasets, enter Hive and select the connector from the results.
  3. Under Connection, select the connection that you previously set up. If you have not previously set up a connection, select New Connection and follow the steps in the previous section.
  4. On the left panel, enter a Name that identifies the table and select a Folder to save the table.
  5. Under Update Frequency, specify how often you want Connect to update the table. You can import the data one time or have Connect continuously import the data at a set time interval. Connect automatically creates a flow for your import. If you select Once, you can use the flow at a later time to automatically run the import again.
  6. Under Setup [1 of 6], select the Database that contains the data. Under Skip to SQL, select Yes to skip the import options and enter the SQL statement manually. Select No to use the Minitab Connect options to create the SQL statement. Then select Next.
  7. Select the Table that you want to import. Then select Next.
  8. In Fields, enter text to filter the list. Then select the table fields that you want to import and select Next.
  9. In Limit, enter a value to limit the number of records that you want to pull. Leave this field blank to pull all records. Then select Next.
  10. In Query, Connect displays the query that runs on the HiveServer. You can make additional changes to the query. By default, Connect validates the query. But if you have a complex query that you've already tested and you want to skip the time Connect spends to validate it, select No for Validate Query. Then select Next.
  11. From Refresh Method, select how you want the table to refresh.
    Replace
    Replace existing records with imported records.
    Append
    Add all imported data to the existing data.
    Append New
    Add only new imported records to existing data.
    Update
    Add new imported records and update existing data.
  12. In Refresh Key, enter text to filter the list. Then select one or more fields to use to identify unique records.
  13. Select Save . If you go to the Prep Tool and select Run , Minitab Connect displays the imported data.
    Note

    If no data appears after you select Run, select Reset Config .

When you save the import, Connect creates a flow with the settings that you selected. For more information on how to schedule a flow and add more data processes to clean data, go to Overview of the Flow Tool .

Export a file

Complete the following steps to export data from a Minitab Connect table.
  1. Open the Outputs tab of the Flow Tool .
  2. Select the plus sign beside Export to add a new export.
  3. Under Export, select New Export. If you select One Time Download, Connect downloads the file to your computer and does not create an export.
  4. Enter a Name for the export.
  5. Under View, select a saved view of the table. If you select None, Connect exports all the data from the table. For more information on saved views, go to Example of creating a data view.
  6. Under Delivery, select Connection.
  7. Under Connection, select the connection that you previously set up.
  8. Select the Database and enter a Table Name for the table in Hive. The File Type must be CSV

You can specify other options. When you're finished, select Save to add the export to the list of exports in the Outputs tab. Select Run to run the export.