Data Stage Intermediate Quiz
Data Stage Intermediate Quiz contains set of 103 Data Stage MCQ Questions With Answers which will help you to clear Intermediate level quiz.
1) Deletion code generated in change capture when the
- Record which present only in before dataset not in After dataset
- Records which present only in after datset not in before dataset
- Record present in both the datasets
- None
2) Change capture stage contains two input links as
- previous and current
- history and delta
- before and after
- None
3) What are the characteristics of the containers
- Containers are used to execute jobs.
- Containers are a group of stages and links.
- Containers are used to stop jobs
- Containers allow multiple users to access a single job.
4) Which one is the type of view in Datastage Director?
- Job View
- Log View
- Status View
- All of the above
5) What does the ERROR 81021 Calling subroutine DSR_RECORD ACTION=2 mean and the solution is?
- It means when a job sequence is used and it contains many stages (usually more than 10) and very often when a network connection is slow and the solution is to redesign the whole job
- Error means that the source records are bad and the job needs a restart
- It is due to the communicaiton failure between DataStage client and server. First try to complie by using explicit login to Datastage Designer if problem still persist then execute DS.REINDEX ALL command from the Datastage shell.
- Job has failed with the error 81021 due to large volumes of data processing and the solution is to split the source records into multiple threads
6) Encode stage, contains maximum how many links
- 3
- 10
- 1
- 0
7) Environment variable which needs to be set to debug the run time environment
- $OSH_PRINT_DROPPED_COLUMNS
- $OSH_PRINT_SCHEMAS
- $OSH_SHOW_COMPONENT_CALLS
- $OSH_SHOW_STARTUP_STATUS
8) Environment variable that needs to be set for compiling the datastage job
- APT_LINKER
- APT_PM_DBX
- APT_LINKEROPT
- APT_COMPILER
9) Which environment variable,when set to true,causes a report to be produced which shows the operators,processes and data sets in the job?
- APT_DUMP_SCORE
- APT_JOB_REPORT
- APT_MONITOR_SIZE
- APT_RECORD_COUNTS
10) What does setting an environment variable, specified as a job parameter, to PROJDEF do?
- Populates the environment variable with the value of PROJDEF.
- Explicitly unsets the environment variable.
- Uses the value for the environment variable as shown in the DataStage Administrator.
- Uses the current setting for the environment variable from the operating system.
11) what is meaning of file extender in data stage server jobs?
- File extender means adding a new file to an already existing file in datastage
- File extender means adding only records to an already existing file in datastage
- File extender means adding columns or records to an already existing file in datastage
- File extender means adding only columns to an already existing file in datastage
12) Select the correct statement
- Remove duplicate stage having reject link
- Filter stage having reject link
- Both A and B
- None of the above
13) What Stage is used to read a flat file with pipe delimited.
- File lookup Stage
- sequential Stage
- Complex Flat File Stage
- Transformer Stage
14) Debug menu is available for
- Server Jobs
- Parallel Jobs
- Shared Containers
- All of the above
15) Which statement is not true from following
- Annotations can be used in server and parallel job but not in mainframe jobs
- Annotations can be used in server and parallel job but not in shared containers
- Annotations can be used only in server job
- None of the above
16) Variable used to set the directory to write the build op code
- OSH_BUILDOP_OBJECT
- OSH_BUILDOP_XLC_BIN
- OSH_BUILDOP_CODE
- A&C
17) Degree of Parallelism is DataStage job is defined by
- Logical nodes configured in the config file
- No of nodes in the hardware
- Job Run time Parameters
- Target Database
18) Is direct lookup possible using DB2 enterprise stage
- Yes
- No
19) A left outer join can be implemented by
- Join Stage
- Lookup Stage
- Join or Oracle stage
- None of the above
20) what does deletion of a dataset from “DataStage designer -> Tools -> DataSet Management” do?
- Deletes the control file only
- Deletes the dataset file at the server only
- Deletes both the control and server files
- None of the above
21) Datastage jobs can be exported in .XML file as well as in .DSX file
- True
- FALSE
22) Datastage Native operators are developed in
- C ++
- C
- BASIC
- COBAL
23) Loading Data into Teradata tables is achieved by
- ODBC Stage
- Teradata API
- JAVA API
- All of the above
24) Job Version Control can be achieved by
- Versioning Tools
- Versioning feature available within the Tool
- Both
- None of the Above
25) Which two tasks will create DataStage projects
- Install the DataStage engine
- Copy a project in DataStage Administrator
- Add new projects from DataStage Administrator
- Both A and C
26) Which three lookup types may be performed in the Lookup stage?
- Equality match
- Range on stream link
- Range on the reference link
- All of the above
27) Which Oracle Enterprise stage read property can be set using db options to tune job performance?
- memsize
- arraysize
- partitionsize
- transactsize
28) Which two stages allow reject links?
- Join stage
- Merge stage
- Lookup stage
- Both B & C
29) You would like to compare two versions of a job that has been copied and renamed. How would
you proceed?
- Use Advanced Find to locate job duplicates.
- Use the Compare against feature
- Verify the properties of the job for the version.
- Perform a check sum on an export of the job.
30) Which two statements describe both File Sets and Data Sets?
- File and Data Sets preserve partitioning.
- File and Data Sets are stored in internal format
- File and Data Sets contain header file and data files
- Both A & C
31) Which stage does not require sorted input?
- Join stage
- Merge stage
- Lookup stage
- Remove Duplicates stage
32) Which partitioning method requires specifying a key
- DB2
- Entire
- Modulus
- Random
33) What is descriptor file in dataset stage?
- It contains the description of data file
- It contains the data file
- It contains the address of data file
- All of the above
34) What is data file in dataset stage ?
- Data File contains the data in native format
- Data File contains the configuration data
- It contains the address of dataset
- None of the above
35) What is OConv () function and where is it used?
- OConv is used to convert only date into internal format
- OConv is used to convert only date into user understandable format
- OConv is used to convert the system understandable format to user understandable format
- None of the above
36) What is IConv () function and where is it used?
- IConv is used to convert only date into internal format
- IConv is used to convert the input to system understandable format
- IConv is used to convert only date into user understandable format
- All of the above
37) What is NLS in datastage? how we use NLS in Datastage ?
- NLS stands for no language support. It is used for excluding other country languages like French, German, Spanish, etc.
- NLS stands for national language support. It is used for including other country languages like French, German, Spanish, etc(whose scripts are similar to English) in the data that is processed
- NLS stands for neutral library support. It is used to support the system library functions used for other country languages like French, German, Spanish, etc.
- None of the above
38) What is the link partitioner in datastage job?
- It’s used for partition parallelism
- It’s used for pipeline parallelism
- The Link Partitioner stage is an active stage which takes one input andallows you to distribute partitioned rows to up to 64 output links. It’s used in DataStage Server Jobs.
- All of the above
39) What is Modulus in Dynamic Hashed File?
- If the size of the file remains same it is called as “Modulus”
- If the size of the file increases it is called as “Modulus”
- If the size of the file decreases it is called as “Modulus”.
- None of the above
40) What is Splitting in Dynamic Hashed File?
- If the size of the file remains same it is called as “Splitting”.
- If the size of the file increases it is called as “Splitting”.
- If the size of the file decreases it is called as “Splitting”.
- None of the above
41) Link Partitioner enables to run server jobs in parallel but needs MPP system
- TRUE
- False
42) Enable hashed file cache sharing can be used to
- Run multiple instances of a job
- Enable multiple processes to access the same hash file in cache
- Save memory resources and speed up execution
- Both B & C
43) Comments in routine should begin with
- REM
- * (Start Sign)
- ! (Exclamation Sign)
- All of the above
44) Which one is correct from below:
- IF X THEN
- A = B; REM The rest of this line is a comment; B = C
- END
- IF X THEN
- A = B; REM The rest of this line is a comment
- B = C
- END
- IF X THEN
- A = B; * The rest of this line is a comment; B = C
- END
- All of the above
45) Server Job compiles successfully but doesn’t run because
- Job has not been compiled with compatible compiler
- Job design have cyclic dependencies within a sequence of active stages
- Configuration file is missing from Datastage server
- All of the above
46) Which of the below statement is not true when a Datastage job is validated, :
- Connections are made to the data sources or data warehouse for server jobs
- Parallel job runs in ‘check only’ mode so data is not affected
- Intermediate files are create
- A
- B
- C
- D
47) Parameter Sets provide an easier and faster method to add parameters to a job, eliminating the need to add parameters invidiually to each job.
- True
- FALSE
48) Which of the following statement is incorrect:
- We can take an export of Data Stage job executable in DSX format.
- We can take an export of Data Stage job executable in XML format.
- We can take an export of Data Stage job executable in pdf format.
- All of the above
49) What is Modulus and Splitting in Dynamic Hashed File
- Modulus – Increasing the file Size
- Splitting – Decreasing the file size
- Modulus – Decreasing the file Size
- Splitting – Increase the file size
- None
50) Size of read cache of a hash file can be set to the value between
- 0-256 MB
- 0-1024 MB
- 0-999 MB
- 0 – 2048 MB
51) What are the two types of hashed files?
- Static and Dynamic
- Generic and Specific
- Static and Specific
- Generic and Dynamic
52) One of the below join option is not available in Join stage
- Self join
- Inner join
- Left outer join
- Full outer join
53) Which one is false statement about the Join stage?
- All the inputs to the Join stage must be sorted by the Join key.
- Join stages can have reject links that capture rows without matches
- The Join stage supports inner, left outer, and right outer joins.
- None of the above
54) How does a Join stage process an Inner join?
- It transfers all values from the right data set and transfers values from the left data set and intermediate data sets only where key columns match.
- It transfers records from the input data sets whose key columns contain equal values to the output data set.
- It transfers all values from the left data set but transfers values from the right data set and intermediate data sets only when key columns match.
- It transfers records in which the contents of the key columns are equal from the left and right input data sets to the output data set. It also transfers records whose key columns contain unequal values from both input data sets to the output data set.
55) Number of input links that a merge stage can accepts
- 1
- 3
- 2
- n
56) Which feature does the Merge stage have that is not available in the Join and Lookup stages?
- All inputs must be sorted.
- Input data may be unsorted.
- Several reject links may be specified
- No more than one reject link may be specified.
57) A DataStage job contains a parallel Transformer with a single input link and a single output link. The Transformer has a constraint that should produce 1000 records,however only 900 came out through the output link. What should be done to identify the mis
- Scan generated osh script for possible errors.
- Remove the constraint on the output link.
- Turn trace on using DataStage Administrator.
- Add a Reject link to the Transformer stage.
58) Which one is false statement about the Parallel Transformer stage?
- The Transformer allows you to copy columns.
- The Transformer allows you to do lookups.
- The Transformer allows you to apply transforms using routines.
- The Transformer allows you to do data type conversions.
59) How does Hash Partition in a join stage affects the performance?
- Better performance
- Lower performance
- No Change
- Cant Say
60) The default partitioning method of a derived operator is
- Any
- Hash
- Round Robin
- Random
61) Which one is not a keyless partitioning method?
- Entire
- Modulus
- Round Robin
- Random
62) Which two partitioning methods are keyless?
- Round Robin
- Entire
- Hash
- Both A & B
63) Which one of the below is a passive stage?
- Peek stage
- Aggregator stage
- Sort stage
- Transformer stage
64) By default where can you output data using Peek stage?
- job log in director
- link properties of peek
- stage properties of peek
- none
65) Pivot stage in DataStage before 8.5 supports
- Only horizontal pivoting i.e. columns into row
- Only vertical pivoting i.e. rows into columns
- Both A & B
- None of the above
66) Players process are A.The actual processes associated with Stages B.Send stderr to SL C.Establish connections to other players for data flo
- A,B and C
- All
- D only
- A and B
67) Protected Project can be accessible only by user who has
- Super Operator role
- Production Manager role
- Super Operator & Production Manager Roles
- None
68) What Happens if RCP (Runtime column propagation) is disabled?
- Datastage jobs cannot be run
- OSH has to perform Import and export every time when the job runs and thus processing time of job is increased.
- The metadata of those stage whose output connects to the shared container input, will not be stored
- No impact on Datastage jobs and the metadata will be propagated at run time
69) When RCP is Disabled A.DataStage Designer will enforce Stage Input Column to Output Column mappings. B.At job compile time modify operators are inserted on output links in the generated osh C.Modify operators can add
- A only
- A and B
- None
- All
70) When RCP is Enabled A.DataStage Designer will not enforce mapping rules. B.No Modify operator inserted at compile time. C.Danger of runtime error if column nam
- A and B
- B and C
- All
- D only
71) Which should specified to manage Runtime Column Propagation?
- enabled in DataStage Administrator and at the stage level
- attached to a table definition in DataStage Manager
- enabled only at the stage level
- enabled with environmental parameters set at runtime
72) One of the below object don’t be a part of DataStage Repository
- Jobs
- Table Definitions
- Shared containers
- Local containers
73) Section Leader process are A.Forks Players processes (one/Stage) B.Establish connections to other players for data flow C.Manages up/down communication. D.Clean up upon completion.
- A and C are correct
- A and B are correct
- C and D are correct
- None
74) Sequencer job does not consist?
- Activity Stage
- Command Stage
- Routing Stage
- Non of them
75) Which one of the below would not be necessary to build a Job Sequence that: picks up data from a file that will arrive in an directory overnight, launches a job once the file has arrived, sends an email to the administrator upon successful completion of t
- Notification Activity
- Wait For File Activity
- Job Activity
- Sequencer
76) Difference between Hashfile and Sequential File in server job?
- Hash file stores the data based on hash algorithm and on a key value. A sequential file is just a file with no key column.
- Hash file can be used as a reference for look up. Sequential file cannot
- searching a record is faster in hash file as comparedf to sequential file.
- All of the above
77) Sequential File Stage A.Normally will execute in sequential mode B.Can execute in parallel if reading multiple files (file pattern option) C.Can use multiple readers within a node on
- A
- B
- A and B
- All
78) Which “Reject Mode” option in the Sequential File stage will write records to a reject link?
- Output
- Fail
- Drop
- Continue
79) How I can convert Server Jobs into Parallel Jobs?
- You can convert your server job into a server shared container. The server shared container can also be used in parallel jobs as shared container.
- Using IPC Stage, by keeping this stage in between two passive stages
- Using LINK PARTITIONER AND LINK COLLECTOR
- Not possible
80) How can improve the performance of the server jobs?
- By enable inter process row buffering through the administrator.
- By adding IPC stage between two passive stages
- A only
- Both A and B
81) Identify the usages of INPROCESS , INTERPROCESS row buffers from the following
- In Process – Connected between active stages. Improves performance of the jobs by turning of and on job recompilation.
- Inter Process – Used when running on SMP parallel system. Enables the job to run using a separate process for each active stage, which
- In Process – Connected between active stages. Improves performance of the jobs by turning in-process row bufferring during execution of the job
- Inter Process – Used when running on SMP parallel system. Enables the job to run using the same process for e
- In Process – Connected between active stages. Improves performance of the jobs by turning in-process row bufferring on and recompiling the job.
- Inter Process – Used when running on SMP parallel system. Enables the job to run using a separate process f
- In Process – Connected between active stages. Improves performance of the jobs by turning of and on job recompilation.
- Inter Process – Used when running on MPP system. Enables the job to run using a separate process for each active stage, which is run s
82) True or False? A.Server jobs are compiled and run on DataStage Server B.Parallel jobs are compiled and run on a DataStage Unix Server and can be run in parallel on SMP M
- Both are False
- A True and B False
- Both are True
- B True and A False.
83) In MPP/ Cluster systems A. Each node is a uniprocessor or SMP. B. Its having own hardware resources. C. Its sharing hardware resources across all nodes
- B and C
- D Only
- A,B and D
- B only
84) if stable sort is set to true it will allow the duplicate records to output links?
- True
- FALSE
85) Surrogate Key stages properties are A.Value Of the key is 16-bit, 32-bit, or 64-bit integer. B.starting number is 0 by default. C.starting number is 1 by de
- A,B and D are correct.
- D Only
- A and B Only
- A Only
86) Which Statements describes how to add functionality to the Transformer stage?
- Edit the C++ code generated by the Transformer Stage
- Create a new parallel routine in the routines category that specifies name,type,path and return type of an external program
- Create a new server routines in the routines category that specifics the name and category of the function written in datastage Basic
- Create a new parallel routine in the routines category that specifics the name,path,type and return type of a function written and compliled in C++
87) One of the below execution order is true in Transformer Stage
- Stage Variable > Function > Routine
- Stage Variable > Constraint > Derivation
- . Constraint > Stage Variable > Equation
- Derivation > Function > Stage Variable
88) Which one of the below reason would cause of using a Transformer stage instead of a Copy stage?
- Drop a column.
- Send the input data to multiple output streams.
- Select certain output rows based on a condition.
- All of the above
89) In CDC stage the ‘Delete’ option would recognise those
- records that are dropped in After Link
- records that are dropped in before Link
- records that are dropped in both the link
- A & B
90) Which containers can be saved independenly
- Shared Containers
- Local Containers
- Globe containers
- direct containers
91) During run time what is the default warnings after which a job would abort?
- 100
- 150
- 50
- 200
92) A job is built with multiple active stages interconnected. To get the warnings that’s generated correctly at the active stage which environmental variable has to be used?
- APT_CONFIG_FILE
- APT_DISABLE_COMBINATION
- APT_STARTUP_SCRIPT
- APT_THIN_SCORE
93) Which environment variable is used to override the pad character of 0x0 (ASCII null), used by default when InfoSphere® DataStage® extends, or pads, a string field to a fixed length
- APT_STRING_PADCHAR
- APT_DELIMITED_READ_SIZE
- APT_MAX_DELIMITED_READ_SIZE
- APT_IMPORT_PATTERN_USES_CAT
94) Exporting the job in ISX feature helps to
- Export jobs within a project
- Export jobs across projects within a server
- A & B
- Exports job even with executables if needed
95) In a Join stage the key columns have a different naming convention. So
- Use transformer stage before Join stage to rename the key columns to match both the links
- Use Copy stage before the Join stage to rename the Key columns to match
- Need not use any stages but directly connect the link to Join Stage
- A & B
96) The job has stage variables defined in the transformer. The stage variables are defined in such a way that they are dependent on each other
- The order of the stage variables defined impacts the output
- The order of the stage variables defined does not have any impact on the output
- Order of Execution within the stage variables are not mandate
- None
97) The Sort stage and Link sort stage does the functionality of the sorting. So
- Both of the types of stage do only sorting and no additional functionality
- Duplicates can be removed in link sort
- Duplicates can be removed in Sort stage
- Key change column is available in link sort
98) Complex flat files can
- Read flat files
- Read compressed VSAM files
- can have reject link
- ALL
99) In the lookup stage the warning “Ignoring duplicate entry at table record” indicates
- Duplicate in primary link
- duplicates in the reference link
- Duplicates at both the link
- A & B
100) Which one streamlines collaboration between business analysts, data modelers, and developers by capturing and defining business requirements in a common, familiar format and then transforming that business logic directly into DataStage ETL jobs
- Designer
- Glossary
- FastTrack
- Director
101) For Joining data between two different links with a logic of left outer and with the rejects
- use Lookup stage
- Use Join stage
- Use Merge stage
- A &C
102) For getting data from a reference link that’s huge in size
- use Lookup stage
- Use Join stage
- Use Merge stage
- B&C
103) To import the table definition use the following options
- ODBC Table definition
- Plug in Metadata definition
- Orchestrate Schema Definition
- All