Previous Topic: Configure Number of Threads for a Bulk Load SubmissionNext Topic: Use Case for PeopleSoft


How to Use Kettle Pentaho with the Bulk Load Client

You can use the Bulk Load Client to also run a Kettle job before the bulk load process using the Bulk Load Client command line options. <Insert X-ref to “Bulk Load Client Command Line Options” topic>

The sample uses the following options:

imbulkloadclient.bat -f Kettle -t C:/MyKettleJob.kjb -o C:/MyOutput.csv -x 60 

The following sections explain each option:

Select the Kettle Job option

-f Kettle 

This option specifies that a Kettle job is going to be run. The base option, -f / --format <value>, determines the format of the input file / transform.

Specify the Kettle Job file

-t C:/MyKettleJob.kjb

This option specifies C:/MyKettleJob.kjb as the file (and its location) used as the template for the input file transformation.

Note: This command string does not use the existing -i / --inputFile option for a Kettle Job. Loading the input file should be part of your Kettle Job/Transform

Specify the output File to Bulk Load

There are two ways to specify the output for the Kettle Job that will be bulk loaded.

When outputting the Kettle to a single file, you would use an option similar to the following:

-o C:/MyOutput.csv

The -o / --outputFile <value> option determines the file of transformation process.

When outputting the Kettle to several files, you would use options similar to the following:

-d C:/MyOutputDir -O MyOutput1.csv,MyOutput2.csv

This uses the following command line options:

Specify a Timeout for the Kettle Job

-x 60

This is the Timeout for Kettle transformation, in seconds.

This command is based on the following format:

-x / --transformTimeout <value> 

Note: This command is optional.

Example of Executing a Bulk Load Client with Kettle

For a single output file:

imbulkloadclient.bat -f Kettle -t C:/MyKettleJob.kjb -o C:/MyOutput.csv -x 60 

For multiple output files:

imbulkloadclient.bat -f Kettle -t C:/MyKettleJob.kjb -d C:/MyOutputDir -O MyOutput1.csv,MyOutput2.csv -x 60