Pentaho Expert Quiz
1) What are the execution options available for running a transformation
- Local Execution
- Execute remotely
- Execute clustered
- All of the above
2) ____________is a simple web server that allows you to execute transformations and jobs remotely
- Carte
- PAN
- Kettle
- None of the above
3) Which step is used to update a junk dimension
- Dimension lookup/update
- Combination lookup/update
- Insert/update
4) Which of the below statements helps in improving the performance
- indexes need to be enabled before running a session
- Drop the constraints and indexes before running the session and rebuild them after completion of session
- indexes should be dropped but constraints should be enabled before running a session
- disable the constraints but enable the indexes before running a session
5) How can you minimize the cache for a joiner transformation?
- Make the smaller source the master source
- Make the smaller source the details source
- Sort the master data prior to the joiner transformation
- sort the detail data prior to the joiner transformation
6) [(# of reads from disk) + (# of writes to disk)]/[(# of reads from memory cache) + (# of writes to memory cache)] is the formula for
- hit ratio
- miss ratio
- a&b
- None of the Above
7) Which is not valid for optimizing system bottleneck?
- Improve network speed
- improve CPU performance
- increase paging
- check hard disks on related machines
8) What are the Xms and Xmx parameters when starting JVMs
- The flag Xmx specifies the maximum memory allocation pool for a Java Virtual Machine (JVM), while Xms specifies the initial memory allocation pool.
- The flag Xmx specifies the minimum memory allocation pool for a Java Virtual Machine (JVM), while Xms specifies the initial memory allocation pool.
- The flag Xmx specifies the maximum memory allocation pool for a Java Virtual Machine (JVM), while Xms specifies the minimum memory allocation pool.
- None of them
9) when using partition you would choose the most non repeating key as the partiton key?
- TRUE
- FALSE
10) which step can be used to insert booleans into a MySql database as PDI encodes a boolean as ‘Y’ or ‘N’ and this can’t be insert into a BIT(1) column in MySql.
- Select values
- Javascript
- Calculator
- Value Mapper
11) I made a transformation using 28/222 in a calculator step the 2 input fields are integer but my result type is Number(6, 4). What is the expected output
- 0
- 1261
- 12612612613
12) The HTTP client step needs to be triggered explicitly in PDI?
- TRUE
- FALSE
13) Does the HTTP client support SOAP?
- YES
- NO
14) In PDI, most of the steps use optimization techniques which map column names into field numbers
- TRUE
- FALSE
15) Whch stepcan be used to join heterogenous sources
- Multiway merge join
- Join rows
- Database Join
- None of the above
16) How many evaluation modes are there in a job hop?
- 2
- 3
- 4
- None of the above
17) What are the different kinds of job hops available in PDI. Select all that apply
- Unconditional
- Follow when result is true
- Follow when result is false
- All of them
18) In the database lookup step “Do not pass the row if the lookup fails ” is enabled what is the resulting join
- Inner join
- Outer join
- Left outer join
- None of the above
19) What are the different access methods available to connect to database in PDI?Select all that apply
- Native(JDBC)
- JNDI
- OCI
- All of them
20) Which of the below are true for Enable Caching in Database Lookup?? Select all of them
- Increased Cache size increases performance
- Reduces the DB Calls and increases the performance
- This means that the database is expected to return the same value all the time for a certain lookup value.
- All of them
21) Which one is not correct about filter transformation?
- Act as a ‘where’ condition
- Can Pass Multiple Conditions
- Act like ‘Case’ in pl/sql
- If one record does not match condition, the record passes to the next record
22) What parallelisms does Pentaho support?
- Data Parallelism : data is parallelly worked
- Component Parallelism : Components are executed Parallely
- Pipeline Parallelism : Data is passed from one component to another component.
- All of the above
23) Pentaho supports
- Only homogeneous source and target
- Heterogeneous source and target
- Only heterogeneous Target
- Both A and B
24) Which is the better approach to utilize logic from one transformation or job in another process?
- Transformations and jobs can be nested
- Transformation logic can actually be shared by creating subtransformations
- Both A and B
- None of the above
25) What is Uniform Data Access Integration?
- UDAI places the data in the source systems.
- A set of views are defined for providing access the unified view to the clients / customers.
- Zero latency of data can be propagated from the source system.
- All of the above
26) What is the use of Select values Step?
- Select and Alter — Specify the exact order and name in which the fields have to be placed in the output rows
- Remove — Specify the fields that have to be removed from the output rows
- Meta-data – Change the name, type, length and precision (the metadata) of one or more fields
- All of the Above.
27) What is the Default Insert method used by Table Output Step in Pentaho?
- Batch Insert
- Bulk Insert.
- Single Row Insert.
- None of the above.
28) What are the Types of Joins that can be Specified in Merge Join Step.
- Only INNER
- Only LEFT OUTER and RIGHT OUTER
- INNER , LEFT OUTER , RIGHT OUTER , FULL OUTER
- Only FULL OUTER
29) Oracle Bulk Loader Step will use which of the following Insert Techniques..
- Batch Insert
- SQL Loader
- Interval based Inserts.
- None of the above
30) How is a Pentaho Variable Defined for a Unix Environment.
- %%VariableName%%
- ${VariableName}
- %%VariableName$$
- None of the above.
31) Pentaho Execute SQL Script Step can be used..
- during the initialization phase of the transformation
- for every input-row that the step is given
- a & b
- none of the above.
32) Merge Join Step Requires Inputs to be..
- Pre-Sorted prior to Join Step.
- No need to be sorted, the step itself performs the Sort.
- Needs to be sorted using Sort Step.
- A & C.
33) Sort rows step can be used to remove duplicate data ?
- TRUE
- FALSE
34) How to determine the slowest step in a transformation
- A slow step will have consistently large input buffer sizes
- A slow step will consistently have low input buffer sizes
35) How to determine the fastest step in a transformation
- A fast step will have consistently large input buffer sizes
- A fast step will consistently have low input buffer sizes
36) Which of the following statements are true (Select all of them)
- Arguments are command line arguments that you would normally specify during batch processing (via Pan)
- Arguments are environment or PDI variables that you would normally set in a previous transformation in a job or in the Operating System.
- Variables are environment or PDI variables that you would normally set in a previous transformation in a job or in the Operating System.
- A & B
37) The Simple-JNDI layer does not create true JNDI connections, so the behavior of connections in Spoon may differ slightly compared to an Application Server running the same PDI job or transformation.
- TRUE
- FALSE
38) What is the syntax to pass an argument to the table input step from the previous step.
- SELECT VID FROM CO_VEHICLES WHERE PONO = ${input_param};
- SELECT VID FROM CO_VEHICLES WHERE PONO = ?;
39) What is the syntax to pass an variable to the table input step
- SELECT VID FROM CO_VEHICLES WHERE PONO = ${input_param};
- SELECT VID FROM CO_VEHICLES WHERE PONO = ?;
40) What folder are the kettle.properties and shared.xml files stored in
- .pentaho
- .kettle
- data-integration
- config
41) How can you set where the folder containing the kettle.properties and shared.xml is
- Environment variable – KETTLE_HOME
- Environment variable – PENTAHO_HOME
- It can’t be changed
- It is always located in the directory where the script is run from
42) What is stored in kettle.properties
- Default properties file for variables
- Default shared objects file
- The database cache for metadata
- The local repositories file
43) How do you access Parameters in a Transformation so that they can be used with the data records
- Use Get Variables to create variables populated with the Parameter’s value
- Parameters can be directly referenced by their name for each data record
- They aren’t used in the data record
- Parameters can only be used by the first step in the transformation
44) Are the JavaScript steps the same for Job and Transformation?
- Yes
- No
45) If there is a name collision between a parameter and a variable, what will take precedence
- Parameter
- Variable
- Job will throw an error
- None of the above
46) Execution order for Java Script in Modified Java Script Value step.
- Set Start Script
- Set End Script
- Set Transform Script
- 1,2,3
- 3,1,2
- 1,3,2
- 2,1,3
47) Text file input step have compression option in it.
- TRUE
- FALSE
48) Which of the below option is not available in Generate Random Value
- Random Integer
- Random Number
- Random String
- Random Character
49) Which Output type is not available in Automatic Documentation Output Step
- DOC
- XML
- XLS
50) Valid options for When writing rows in Microsoft Excel Writer Step
- overwrite existing cells
- shift existing cells down
- shift existing cells up
- 1,2
- 2,3
- 1,3
- All of the above
51) Unique rows step requires sorted input
- TRUE
- FALSE
52) What will be the output of following expression in formula stage ISNA(“#N/A”)
- TRUE
- FALSE
53) What will be the output of following expression in formula stage ISTEXT(“1”)
- TRUE
- FALSE
54) Memory Group by step requires sorted input
- TRUE
- FALSE
55) By default Null and “” are the same in PDI
- TRUE
- FALSE
56) IN PDI every row which is sent over a single hop needs to be of the same structure
- TRUE
- FALSE
57) which step can be used to get system date and arguments?
- Get system info
- get varibles
- Get files from result
- get repository names
58) Are loops allowed in a transformation
- Yes
- No
59) Are loops allowed in a job
- Yes
- No
60) How many evaluation modes are there in a job hop?
- 2
- 3
- 4
- None of the above