Complete the following steps to import data from Hive and create a new table in Connect.
- From the Home screen, select the
Add button under Tables.
- In Datasets, enter Hive and
select the connector from the results.
- Under Connection, select the connection that
you previously set up. If you have not previously set up a connection,
select New Connection and follow the steps in the
previous section.
- On the left panel, enter a Name that identifies the table
and select a Folder to save the table.
- Under Update Frequency, specify how often you want
Connect to update the table. You can import the data one time or have
Connect continuously import the data at a set time interval. Connect
automatically creates a flow for your import. If you select
Once, you can use the flow at a later time to
automatically run the import again.
- Under Setup [1 of 6], select the Database that contains the data. Under Skip to SQL, select Yes to skip the import options and enter the SQL statement manually. Select
No to use the Minitab Connect options to create the SQL statement.
Then select Next.
- Select the Table that you want to import. Then select Next.
- In Fields, enter text to filter the list. Then select
the table fields that you want to import and select Next.
- In Limit, enter a value to limit the number of
records that you want to pull. Leave this field blank to pull all records.
Then select Next.
- In Query, Connect displays the query that runs on the
HiveServer. You can make additional changes to the query. By default,
Connect validates the query. But if you have a complex query that you've
already tested and you want to skip the time Connect spends to validate it,
select No for Validate Query.
Then select Next.
- From Refresh Method, select how you want the table to
refresh.
- Replace
- Replace existing records with imported records.
- Append
- Add all imported data to the existing data.
- Append New
- Add only new imported records to existing data.
- Update
- Add new imported records and update existing data.
- In Refresh Key, enter text to filter the list. Then
select one or more fields to use to identify unique records.
- Select Save
. If you go to the
Prep Tool and select Run
, Minitab Connect
displays the imported data. Note
If no data appears
after you select Run, select Reset Config
.
When you save the import, Connect creates a flow with the settings that you selected. For more
information on how to schedule a flow and add more data processes to clean data, go
to Overview of the Flow Tool
.