You have a parallel job that based on operational recoverability requirements needs to be broken up into two separate parallel jobs. You have decided to use the Data Set stage to support this job design change. What two characteristics of Data Sets make them a good design consideration in your jobs design change? (Choose two.)
A. They sort the data in a staging area.
B. They automatically convert data types.
C. They persist the parallelism of the job creating them.
D. They use the same data types as the parallel framework.
E. They persist parallelism into a temporary repository table.
Which statement is true when using the SaveInputRecord() function in a Transformer stage.
A. You can only use the SaveInputRecord() function in Loop variable derivations.
B. You can access the saved queue records using Vector referencing in Stage variable derivations.
C. You must retrieve all saved queue records using the GetSavedInputRecord() function within Loop variable derivations.
D. You must retrieve all saved queue records using the GetSavedInputRecord() function within Stage variable derivations.
Which requirement must be met to read from a database in parallel using the ODBC connector?
A. ODBC connector always reads in parallel.
B. Set the Enable partitioning property to Yes.
C. Configure environment variable $APT_PARTITION_COUNT.
D. Configure environment variable $APT_MAX_TRANSPORT_BLOCK_SIZE.
Configuring the weighting column of an Aggregator stage affects which two options. (Choose two.)
A. Sum
B. Maximum Value
C. Average of Weights
D. Coefficient of Variation
E. Uncorrected Sum of Squares
The ODBC stage can handle which two SQL Server data types? (Choose two.)
A. Date
B. Time
C. GUID
D. Datetime
E. SmallDateTime
In a Transformer expression for a stage variable, there is a nullable input column. Assume the legacy NULL processing option is turned off. What happens when a row is processed that contains NULL in that input column?
A. The job aborts.
B. The row is rejected.
C. NULL is written to the stage variable.
D. The value written to the stage variable is undetermined.
Which derivations are executed last in the Transformer stage?
A. Input column derivations
B. Loop variable derivations
C. Output column derivations
D. Stage variable derivations
Your job is using the Sequential File stage as a source. Identify two stage property configurations that will allow you to accomplish parallel reads? (Choose two.)
A. Read Method property set to 'File Pattern' and the environment variable $APT_IMPORT_PATTERN_USES_FILESET is set to 'FALSE'.
B. Read Method property set to 'File Pattern' and the 'Readers per Node' property value set to greater than 1.
C. Read Method property is set to 'Parallel' with the Key property set to a Input column that has the file names.
D. Read Method property set to 'Specific File(s)' with only one file specified and the 'Readers Per Node' property value set to greater than 1.
E. Read Method property set to 'Specific File(s)' with more than one file specified and each file specified has the same format/column properties.
What two project environment variables can be considered in your parallel jobs to support your optimization strategy of partitioning and sorting? (Choose two.)
A. $APT_NO_PART_INSERTION
B. $APT_OPT_SORT_INSERTION
C. $APT_RESTRICT_SORT_USAGE
D. $APT_PARTITION_FLUSH_COUNT
E. $APT_TSORT_STRESS_BLOCKSIZE
You are experiencing performance issues for a given job. You are assigned the task of understanding what is happening at run time for that job. What step should you take to understand the job performance issues?
A. Replace Join stages by Lookup stages.
B. Run the job with $APT_TRACE_RUN set to true.
C. Run the job with $APT_DUMP_SCORE set to true.
D. Replace Transformer stages with custom operators.