Ab Initio Beginner Quiz
In this AbInitio online Quiz, we will cover topics related to Ab Initio, abinitio, ab initio software, ab initio software llc, initio, ab initio etl tool and so on.
1) What is the syntax for local variable declaration
- Let int x;
- let int x = 0
- Int x;
- Int x=0
2) Which component allow change in input and output port Structures
- Reformat
- Redefine
- Replicate
- Both a and b
3) Input table unloads table records into the AB Initio graph
- True
- FALSE
4) Lookup file represents one or more serial files or a multifile of data records small enough to be held in main memory
- True
- FALSE
5) Ceiling Returns the smallest integer value greater than or equal to a specified value
- True
- FALSE
6) Broadcast combines the records arbitrarily into a single flow.
- True
- FALSE
7) Read from and write to the same file in a graph is possible
- True
- FALSE
8) what is the functionality ddl_to_dml function
- Converts a database column in to near equivalent DML type
- Converts a column definition to the corresponding DML type definition
9) Which component will change the order of fields
- Reformat
- Redefine
- sort
- Both a & b
10) What does is_blank(“”) returns
- 1
- 0
- Both a & b
- None of the above
11) What does layout in graph means to the co>Operating system
- The location of files
- The number and locations of the partitions of multifiles
- The number of the partitions of program components and the locations where they execute
- All of the above
12) what would be the output of printf(“%f”, 12345678.90)
- 12345678.90
- “12345678.90”
- “12345678.900000”
- None of the above
13) How to change serial file to mfs
- copy file from serial file path to mfs file path
- Use Partition component
- All of the above
- None of the above
14) Combining multiple flow partitions of data records into a single flow is known as
- re-partitioning
- departitioning
- partitioning
- None of the above
15) When the data is divided into small chunks and processed on different components simultaneously we call it
- Component parallelism
- DataParallelism
- Pipeline parallelism
- None of the above
16) When multiple program components execute simultaneously on different branches of a graph it is called
- Pipeline parallelism
- Component parallelism
- DataParallelism
17) when several connected program components on the same branch of a graph execute simultaneously we call it
- Data Parallelism
- Pipeline parallelism
- Component parallelism
18) what is the output DML format of trash
- no output port/format
- same as input format
- blank
- All of the above
19) what port does Run SQL Component have?
- input port
- output port
- log port
- All of the above
20) Phasing
- separates work load
- sets sequence of operation
- Both a & b
- None of the above
21) Skew is
- load imbalance
- a partitioning component
- load balancer
- All of the above
22) Propagate from neighborhood, component, url, custom, hosts, database: These are
- ab initio connectors
- layout options
- input components
- All of the above
23) XFR is
- abinitio dml file
- redundant file
- user written transformation functions for reusability
- All of the above
24) Error port
- will have error records
- will have error message
- will have both
- doesn’t collect both
25) Reject Port
- will have reject error message
- will have rejected records
- will have both
- doesn’t collect both
26) deselect Port
- will have unselected error message
- will have unselected records
- will have both
- doesn’t collect both
27) How do you truncate a table
- Use Truncate Table component
- By using access tab of input/output component
- All of the above
- None of the above
28) What are the different types of joins
- right, left and full
- inner, full outer and explicit
- Product, full outer and implicit
- All of the above
29) What are the different types of parallelism in Ab Initio?
- Process parallelism, Data parallelism, Component parallelism
- Phase parallelism, Component parallelism, Pipeline parallelism
- Component parallelism, Data parallelism, Pipeline parallelism
- None of the above
30) What is component parallelism?
- A graph with multiple processes running simultaneously on separate data uses component parallelism.
- A graph that deals with data divided into segments and operates on each segment simultaneously uses component parallelism
- A graph with multiple phases running simultaneously on separate data uses component parallelism.
- A graph with multiple components running simultaneously on the same data
31) What is the notation to specify the URL of a layout.
- hostname/path/
- protocol://hostname/path
- path
- None of the above
32) Which of the following components need a sorted input?
- merge
- interleave
- gather
- All of the above
33) why is a gather component used
- To reduce data parallelism
- To reduce component parallelism
- both of the above
- None of the above
34) What is the maximum number of input ports a join components can have?
- only one
- 2
- 4
- 20
35) What does the max-core parameter refer to
- The maximum memory the component can use
- The maximum CPU utilization the component can use
- The maximum time limit the component can use
- The maximum number of rejects the component can allow
36) What does layout of a component mean?
- The place where the component works
- The depth of parallelism
- Number of partitions
- All of the above
37) Which component among the following is the most efficient
- Concatenate
- interleave
- Gather
- Merge
38) Which component saves the status information among the following components
- Phases
- Check points
- Phase breaks
- Commit
39) When complex joining expressions are required which component among these is preferred
- look up
- Join
- Merge
- Gather
40) Which component gives sorted output
- merge
- sort
- rollup
- All of the above
41) Which parameter specifies the components tolerence for reject events.
- reject port
- reject-threshold parameter
- max-core parameter
- log-reject parameter
42) What is the use of merge component?
- Joins Different Flows both sorted & unsorted
- Sorts Data
- Join Different sorted data Flows and maintains Sort order
- None Of The Above
43) Explain maxcore parameter?
- Minimum Amount Of Memory allocated To a Component
- Amount Of Main Memory Allocated To An Instruction
- Maximum Amount Of Main Memory Allocated To Store And Manipulate Data Temporarily
- Amount Of Main Memory Allocated To Store And Manipulate Data Permanently
44) What component will you use to undo the effect of Partition by Round Robin ?
- Partition by Expression
- Partition by key and Sort
- Interleave
- Partition by Range
45) Component that is used to reduce the volume of data flow due to narrow band width or lack of enough disk space to store data
- Compress
- Compress,Gzip
- Compress,Gzip,Gunzip
- None of the above
46) For which component connecting the ports is not mandatory
- Run SQL
- Intermediate file
- Output file
- Broadcast
47) Which is the generally used departitioning component?
- Concatenate
- Gather
- Merge
- All of the above
48) What do u call the file which can treat several serial files having the same record format as a single graph component
- Summary File
- Event Log File
- Adhoc Multifile
- Custom File
49) what is the control partition
- It is partion where the same data is run simultaneously in different components
- location of multifile’s datapartition
- A partition which divides the control information
- none of these
50) Replicate component supports which type of Parallelism?
- Data Parallelism
- Component Parallelism
- Pipeline Parellelism
- All of the above
51) If there are 100 records and only 10 percent rejection can be permitted the what is the value of limit and ramp?
- limit=100;ramp=10
- limit=0;ramp=0.1
- limit=0;ramp=10
- limit=0;ramp=0
52) Which component do not have finalize transform function?
- Dedup sorted
- Denormalize Sorted
- Rollup
- Scan
53) The main programs required for abinitio software?
- only GDE
- OS and GDE
- Co operating System and GDE
- None of these
54) Which partition balances the workload between CPU’s
- partition by round robin
- partition by percentage
- partition by load balance
- none of these
55) The component which does not have any ports
- look up
- Run SQL
- Scan
- Fuse
56) which of the following components does not require a sorted input
- match sorted
- join
- merge
- dedup sorted
57) which of the following components has an ordered input port.
- concatenate
- gather
- partition by key
- Reformat
58) The usage of which component/components is recommended when the incoming data records is unsorted and needs be joined with a small set of records.
- sort and join(input needs to be sorted or grouped)
- join(in memory, inputs need not be sorted or grouped)
- lookup file with reformat
- sort and match sorted
59) which of the following components requires a sorted input
- rollup
- join
- concatenate
- merge
60) delimiter “\001” refers to
- space delimiter
- NULL delimiter
- double space delimiter
- All of the above
61) what does varstring in the drop-down menu in the GDE’s Record Format Editor refer to
- variable length string
- fixed length string
- length prefixed string
- none of the above
62) Usage of which lookup function is recommended when one wants to check up whether a given key value exists in a serial lookup file
- lookup
- lookup_count
- lookup_count_local
- lookup_match
63) which of the following components does not have an ordered output port.
- partition by key
- concatenate
- interleave
- partition by Round Robin
64) .dbc files contain
- database backup configuration details
- database configuration details
- database credentials
- All of the Above
65) Lookup files are faster when
- small enough to store in memory; avoids I/O calls
- Too large to store in memory; avoids I/O calls
- No I/O calls
- None of the above
66) Phases are used to
- Logically divide work loads
- setting sequence of operation
- show different process separately
- A and C
67) .rec file is
- record changes to data
- used to recover point in time
- a recovery file
- None of the above
68) Which of the following are compound data types?
- vector
- union
- record
- All of the Above
69) A package is a collection of
- Functions and Variables
- Procedures and functions
- Procedures and variables
- None of the above
70) Which of the following are layout options?
- propagate from neighborhood
- component
- url
- All of the Above
71) Valid layout option?
- custom
- hosts
- database
- All of the Above
72) Can a resource in internet can be given as a layout?
- Yes
- No
- it has to be mentioned in dbc file
- None of the above
73) How do you set implicitly phases?
- By using staging
- By Using intermediate files
- By using logically divided work spaces
- All of the Above
74) Access tab of input/output table component can be used to
- truncate table
- collect error records
- collect error messages
- All of the Above
75) Replicate
- Non-partition
- makes a copy of records
- Both
- None of the above
76) The types of join are
- inner
- full outer
- explicit
- All of the Above
77) What is ramp and limit?
- error tolerance limit
- ramp is expressed in %
- limit is in no of records
- All of the Above
78) KEY Partitioning
- hash code is computed using the Key and same keys go to same partitions
- As many partitions as keys are created and same keys go to same partitions
- Both
- None of the above
79) Departitioning requires
- Fan in flow
- Fan out flow
- both
- None of the above
80) gather
- Not key based
- reads records as soon as they are available and order is unpredictable
- useful for repartitioning
- All of the Above
81) Merge
- result is sorted if each input is sorted
- result is sorted even if input is not sorted
- result is not sorted
- None of the above
82) Concatenate
- is used to add headers to records sets
- is used to add trailers to records sets
- used to bring non-parallel data flows together
- All of the Above
83) Layout is
- serial
- parallel
- Either
- None of the above
84) Layout
- determines what runs where
- propagates in all directions
- the computer/directory are unknown till runtime
- All of the Above
85) Which is a validate dml function?
- is_valid
- is_null
- is_defined
- All of the Above
86) max-core parameter is found in
- Sort
- Join
- Rollup
- All of the Above
87) Control filename is mandatory for
- DBC file
- Serial File
- MFS file
- Database
88) ab initio is case sensitive
- FALSE
- TRUE
89) Run SQL has ___ port(s)
- One input one output one log
- one log
- One Input One Output
- One Output One Log
90) Error tolerance limit is achieved through
- Ramp
- Limit
- Skew
- A or B
91) Lookup files
- Are small enough to store in memory
- Avoids I/O calls
- A & B
- B
92) ____ is a saving point so job can be restarted from that point case of crash
- Recovery file
- Check point
- DML
- XFR
93) Package is a collection of
- Vector
- Union
- Record
- Functions and Variables
94) Which is NOT a type of Parallelism in Ab Initio
- Pipeline Parallelism
- Process Parallelism
- Component Parallelism
- Data Parallelism
95) Which component will you find the parameter “Output Index”
- Rollup
- Reformat
- Join
- Filter by Expression
96) There is a 2-way multifile. Partition 0 has 3 records and partition 1 has 5 records. We connect this file to a broadcast component with multiple flows going out with the same depth. How many records will come in each flow?
- 20
- 8
- 16
- 10
97) What error will we get if the MAX CORE value is set to too low.
- Record out of order
- Paging by OS
- Too many open files
- Increase MAX CORE
98) Which of the following is NOT a valid command
- m_mv
- m_wc
- m_cat
- m_zip
99) What does “skew” mean
- Uneven distribution of data among partitions
- Choosing incorrect partition key
- Partitions are equally divided
- Balanced Data
100) If a parameter is defined in multiple project as following, which will be the correct order of precedence (from lowest to highest) (1) Stdenv (2)Localenv (3) Private Project
- 1,2,3
- 3,2,1
- 2,1,3
- 2,3,1
101) What is .abinitiorc file used for and its location
- Password file present in home directory
- User configuration file present in home directory
- User configuration file present in private project
- Database configuration file present in common project
102) Which is NOT TRUE about component folding?
- It reduces number of processes
- When component folding is enabled, Co Op automatically scans and folds as many components as it can
- We can specify some components to be excluded from the foldable list
- Decreases graph performance
103) Which of the following is NOT TRUE about Driving Input
- It is present only in in-memory join
- It is always present in a join component regardless of sorted-input
- Ideally largest input is taken as driving input
- All non-driving inputs are taken into memory
104) How to go from n_way to m_way partitioning?
- Departition
- Repartition
- Partition
- None of the above
105) Which air command is used to “checkin” a project through command line?
- air project export
- air project checkin
- air project import
- air project checkout
106) Variable used to set EME path
- AB_AIR_PATH
- AB_AIR_BRANCH
- AB_HOME
- AB_AIR_ROOT
107) What does MFS stands for?
- Multi File System
- Multi File Storage
- Maximum File Storage
- Minimum File Storage
108) Which component breaks pipeline parallelism?
- Reformat
- Sort
- Redefine Format
- Filter By Expression
109) What the difference is between reformat and redefine format?
- Reformat can actually change the bytes of the data whereas redefine format changes the record format
- Reformat changes the record format whereas redefine format changes the bytes of the data
- Both are used to change record format as well as bytes of the data
- There is no difference between Reformat and Redefine component
110) Which command is used to test “DML expression” quickly?
- m_dump
- m_touch
- m_eval
- m_expand
111) What are “NULLs” represent in DML?
- A zero-length string
- Unknown or missing piece of data
- Number zero
- A string containing all blanks
112) What is the difference between phasing and checkpoint?
- Phasing is used for recovery and checkpoint is used for memory management
- Both are used for managing data
- Phasing is used for memory management and checkpoint is used for recovery
- Phasing & Checkpoints both are same
113) How to create surrogate key ?
- Use Assign Key Component
- next_in_sequence()
- A&B
- using surrogate function
114) Command used to list tags on a particular object
- air tag show
- air tag ls
- air tag view
- air tag all
115) How can you find the depth of MFS
- m_eval
- m_wc
- m_expand
- m_dump
116) What are the files present in MFS directories which store the directory structure
- .mfctl
- .mdir
- .air-project-parameters
- .air-project-overrides
117) Which of the following statements in NOT true
- ROLLUP – Generates a final summary record rather than a running cumulative summary for the group.
- NORMALIZE – Optionally computes summary fields in the output record for each group.
- SCAN – Ggenerates an output record that includes a running cumulative summary for the group to which the input record belongs.
- Gather LogS – Collects the output from the log ports of components for analysis of a graph after execution.
118) Which of the following is NOT a valid m_db command
- m_db test
- m_db unload
- m_db gendml
- m_db del
119) Best way to change fixed length data to delimieted data
- Use Rollup component
- Use Scan Component
- Use Filter by Expression
- Use Redefine Component
120) Which of the below component can not separate a file with header/trailer and body records? The file has a record identifier.
- Filter by expression
- Reformat with Output Index
- Partition by Expression
- Partition By Key
121) Which component generates multiple output records from each of its input records.
- Normalize
- Denormalize
- Rollup
- Filter by Expression