Ab Initio Intermediate Quiz
1) Which other component can be used to find the First/Last record from a record group other than Dedup Sorted?
- Denormalize_sorted
- Rollup
- Read Multiple Files
- Fuse
2) Read Multiple files Randomly reads from a list of input files
- True
- FALSE
3) Self sized vectors can be initialized only using loop expression
- True
- FALSE
4) Round robin partition is a partitioning technique to Non -uniformly distribute the data to each of the destination data partitions.
- True
- FALSE
5) m_db_test is used to test a dbc file from the command prompt
- True
- FALSE
6) How many types of parallelism exist in Ab initio
- 3
- 2
- 1
- None of the above
7) Which component is used to join two files with different Layout
- Join
- join with DB
- Fuse
- None of the above
8) How can one reference a value in one component that was created in a different component
- Pass the value in the data stream
- Write the value to a lookup file and look up the value in the downstream component.
- Access the needed value using a script that is the value of a graph parameter you create
- All of the above
9) Which component does not work in pipeline parallelism
- Reformat
- Rollup
- Sort
- Both b & c
10) Defining of phases in AB initio mean
- To provide the means to restart a failed graph from some intermediate state
- To limit the number of simultaneous processes by breaking up the graph
- Performance tuning by managing resources
- Both b & c
11) Which of the following typecast converts x to a type double and divides by 100
- Double(x/100)
- x/100 (double)
- ((double) x) / 100
- Both b and c
12) From an input dataset of say 200 record what component to use to get records between say 80-125
- scan and filter by component
- m_dump
- next_in_sequence() function and filter by
- All of the above
13) Dividing a single flow of records(data) into multiple flows is known as
- partitioning
- re-partitioning
- departitioning
- None of the above
14) Dividing a x-way flows of records(data) into y-way flows is known as
- departitioning
- partitioning
- re-partitioning
- None of the above
15) Which of the following is true
- Replicate is for non-partition
- broadcast – partition
- both make copy of records
- All of the above
16) ab_report is used to
- get statistics
- get report format
- All of the above
- None of the above
17) Can resource in internet can be given as a layout
- no
- yes
- resource must be local
- None of the above
18) What are the different connection methods
- ssh
- telnet
- rexec
- All of the above
19) Which is true about lookup files
- increases I/O calls
- lookup is faster only if file is small enough to store in memory
- join component is better
- than large lookup files
- None of the above
20) Which of the following is true with regards to access privileges of dataset components
- access privileges cannot be set to dataset components
- access privileges can be set to dataset components
- there is no access privilege for dataset components
21) how do you set implicitly phases
- check the phase checkbox
- uncheck the phase checkbox
- use intermediate files
- All of the above
22) Which command is used to test dml expressions
- m_dump
- m_eval
- All of the above
- None of the above
23) How to generate table dml from command line
- m_eval
- m_db gendml
- gendml
- None of the above
24) Which command is used to cleans up files left over from unsuccessfully executed graphs and manually recovered graphs.
- m_rollback
- m_cleanup
- None of the above
- All of the above
25) Which command is used to remove watcher files
- rm
- m_rm
- m_cleanup
- None of the above
26) What are the Dedup criteria?
- Unique only
- First
- Last
- All of the above
27) A configuration file contains the following information:
- The name and version number of the database to which you want to connect.
- The name of the computer on which the database instance or server to which you want to connect runs, or on which the database remote access software is installed.
- The name of the database instance, server, or provider to which you want to connect.
- All of the above
28) For which join type the record-required parameter is used
- inner join
- outer join
- explicit join
- All of the above
29) How is reject-threshold calculated using limit and ramp
- ramp * limit + number of records processed so far
- ramp + (limit * number of records processed so far)
- limit + (ramp * number of records processed so far)
- limit/ramp + number of records processed so far
30) Which component produces intermediate summary records while aggregating
- Roll up
- Scan
- Aggregate
- All of the above
31) Using which component can we specify the rate of data movement from input to output
- Throttle
- Fuse
- Scan
- Sample
32) Which of the following component will you use to parse programmatically
- run program
- read raw
- reformat
- run sql
33) Which of the following has got a deselect port
- Filter by expression
- Join
- sort
- reformat
34) What does a redefine component do?
- It changes the value of the data records according to expression
- It renames the fields in record format and changes its value according to expression
- It renames the fields in the record format without changing its value
- It copies the data records from its input to output record format and changes its values according to key expression.
35) which one of the following is a miscellaneous component
- broadcast
- merge
- trash
- denormalize
36) What is the maximum value allowed for max-core
- ans)231-1
- 221-1
- 211-1
- no limit
37) In which way conflicts arise during check in?
- The Graph In Sandbox Not Latest
- The Graph Check In Happens In Different Project
- Both The Above
- Conflicts Doesn’t Arise
38) How can an explicit join perform inner join?
- Record Required Parameter For Both Ports Are Set To False
- Record Required Parameter For Both Ports Are Set To True
- Record Required Parameter For Port 0 Is True And For Port 1 Is False
- Record Required Parameter For Port 0 Is False And For Port 1 Is True
39) Which of the following is incorrect
- Gzip reduces the volume of data in flow
- Compress reduces the volume of data in flow in UNIX platform
- Gunzip reduces the volume of data in flow
- Gunzip reveses the effect of compress.
40) Which component is more powerful and easier to use than the aggregate component
- Rollup
- Join
- Sort within groups
- None of the above
41) What all components require sorted input?
- Rollup
- Join
- Dedup Sorted
- All of the above
42) The component stops the execution of the graph if the number of reject events exceeds the result of the formula:
- limit + (ramp * number_of_records_processed_so_far)
- limit * (ramp +number_of_records_processed_so_far)
- limit+ramp
- limit*ramp
43) What does a watcher do?
- Eliminating To Do Cues
- Turn on the debugging mode
- Displays flow patterns
- Edit Key specifiers
44) Denormalize sorted
- requires grouped input
- consolidates groups of related data records into a single output record
- generates a vector field for each group and optionally computes the summary field in the output record
- All of the above
45) What is a summary file?
- A file containing information about flows,components ,phases during the most recent run.
- A file containing the summary of the graph run
- All of these
- none of these
46) What is true about a multifile?
- it’s a file which is divided into so many partitions and stored in the same disk
- a multifile contains one multifile and 2 data partitions
- a multifile contains one control file and one or many data partitions
- roundrobin partitioning is not used to partition data in multifle
47) What does a Broadcast component do?
- It arbitrarily combines all the data records it receives into a single flow and writes a copy of that flow to each of its output flow partitions.
- It appends multiple flow partitions of data records one after the other.
- It echoes the final status of the graph run
- none of these
48) Component used to create surrogate keys?
- Assign Key Component
- Fuse
- Run Program
- Sample
49) The parameter used to specify the path containing sas esecutables
- AB_SASHOME
- AB_SAS_CONTROL_FILE
- AB_SAS_AUTOEXEC
- None of the above
50) All packages has the extension
- .xfr
- .pkg
- .mpc
- .mdc
51) What is false about api mode??
- ablocal_expr parameter is available only when dbms_interface is set to api.
- The expression is evaluated once per partition
- cannot be used for fanout specific components
- first and third
52) How to change a Graph so that CPU consumption is less
- Use partitions
- If transformation has many local variables make it global
- Reduce the Degree of parallelism
- 3rd and 4rth
53) If Ab-initio is installed in aunix environment, then the .mp files is converted internally to
- .ksh file
- .xfr file
- .dml file
- .sh
54) which is the default name of a component parameter when it is exported
- parameter name.component name
- component name_parameter name
- parameter name_component name
- component name.parameter name
55) which type of sql statement activity should be most avoided when parallel unload from table is performed
- group by
- joins
- both A and B
- none
56) when using a macro which of the following relating to that macro become runtime parameters
- components
- flows
- parameters
- all the above
57) The value of the AB_MAX_CORE parameter should be set based on
- memory usage per partition
- memory usage over all partitions
- disk space usage for each partition
- disk space usage over all partitions
58) deadlocks occur because of
- no automatic flow buffering
- partitioning and departioning occur in the same phase
- both A and B
- none
59) string_concat(‘string’,NULL) in the expression editor will result in
- string
- NULL
- compile time error
- run time error
60) which of these components has the number of partition as a required parameter
- partition by range
- recirculate
- assign keys
- normalize
61) which of the following can be a state of a file in the sandbox in relation to its datastore
- unresolved
- stale
- current
- all the above
62) A file in your sandbox is in the conflict state and the conflict has been deferred(state-unresolved modified) .Now the file version in the data store has changed again because someone else has checked in.The state of your file is unresolved (conflict).No
- conflict
- unresolved
- stale
- current
63) In which type of predefined sequence specifier are punctuation characters ignored
- index
- phonebook
- machine
- none
64) for which of the following does dml allow four bytes for storage
- packed decimal(8,3,unsigned)
- packed decimal(5)
- packed decimal(4,stripped)
- packed decimal(6,unsigned)
65) which keyword should be used in order to make a custom type sequence specifier treat a group of characters as equivalent to each other
- foldcase
- ignore
- fold
- ignorecase
66) Subgraph is used when
- a single executable has to be created
- number of components and their arrangement are constant from one run of the graph to other
- components,number of components and their arrangement differ from one run of the graph to other
- none
67) Checkpoint
- is a saving point so job can be restarted from that point in case of crash
- is a saving point so job can be restarted from last processed record in case of crash
- is a saving point so job can be restarted from beginning in case of crash
- None of the above
68) An xfr file contains
- user written functions for reusability
- user written procedures for reusability
- user written database calls for reusability
- None of the above
69) Skew is caused by
- Phases
- checkpoints
- load imbalance
- None of the above
70) The following are m_ commands
- m_mkdir
- m_rollback
- m_cleanup
- All of the Above
71) What happens to main graph when subgraph is updated?
- it will not be updated
- it will be updated
- It will be recompiled
- None of the above
72) Can access previlege be set to dataset components?
- Yes
- No
- depends on the layout
- None of the above
73) Broadcast
- for partition
- makes a copy of records
- Both
- None of the above
74) Checkpointed sort
- creates small files and then finally sorts
- sorts partitioned datasets
- both
- None of the above
75) Update SQL component can have “delete” SQL statements
- Yes
- No
- if there are no updates
- None of the above
76) Repartition is
- partition
- departition
- partition and departition
- None of the above
77) What is force_error used for?
- to artificially crash the graph for testing or specific alarm raising
- to catch exceptions for unhandled condition
- to capture error records in error port
- All of the Above
78) get_flow_state is used for
- getting error code
- status updates
- Gets the position of a record in a flow
- All of the Above
79) key specfier is
- primary key parameter to fetch the required columns
- give the dataset key
- give the Db key for connection
- None of the above
80) ordered attribute
- is sorting records
- maintains the sequence of joining multiple flows
- is used for partitioning
- None of the above
81) Fan has the following attributes
- more than one flow can connect
- 2 flows can connect
- single flow can join
- None of the above
82) split string is used
- to feed into sort component
- to feed into ‘parition by range’ component
- both A & B
- None of the above
83) std project env contains
- private project
- public project
- environemnt project
- All of the Above
84) Dead lock
- When two or more processes request same resource
- can be avoided by using phasing
- can be avoided using resource pool
- All of the Above
85) which are multistage components
- scan
- normalize
- denormalize
- All of the Above
86) Multistage components
- are transform components
- records are transformed in stages
- both A & B
- None of the above
87) different partitioners are
- ROUND ROBIN
- KEY
- Both
- None of the above
88) Flow buffers
- are automatically added by GDE
- prevent deadlocking
- can block on read and write
- All of the Above
89) Repartition when
- Key changes
- you want to group records within the partitions of partitioned data
- you want to redistribute to fewer / more number of partitions
- All of the Above
90) Dynamic DML is used when
- different input files are received for processing which have different dml
- flag in the dml and the flag is first read in the input file received and according to the flag its corresponding dml is used
- Both A & B
- Dynamic DML is not possible in ab initio
91) Local Parameter
- needs to be initialized on declaration
- is passed at run time
- is same as formal parameter
- None of the above
92) One of the below, can not be a layout option
- Component
- URL
- Hosts
- DML
93) Load Imbalance in ab initio is represented as a measure of
- Checkpoint
- Vector
- Skew
- MFS
94) If a main graph uses many pre developed graphs as sub graphs, and updating the sub graphs
- Will automatically update main graph
- Needs Manual Updating Of Sub Graphs In Main Graph
- Sub Graphs Can Not Be Edited Individually
- None Of The Above
95) Find the odd one out
- Vector
- Union
- Record
- Integer
96) If an input file contains, CustomerId, DateOfPurchase,AmtOfPurchase, then, AmoutSpentTilldate can be calculated for each customer using
- Rollup
- Aggregate
- Scan
- Sum
97) If a dataflow from 4 CPU needs to be connected to a 16 CPU flow,
- Repartition components should be used
- Aggregate component should be used
- FTP components should be used
- None Of The Above
98) Comparing with Oracle, Input table component of a DB2 table contains
- No Error port
- All the ports like oracle table
- No log port
- No output port
99) Depth error is thrown out when
- Different layouts are connected
- Syntax Errors
- Error port contains no messages
- Log port contains no messages
100) To artificially crash a graph on some condition ________ is used
- force_error
- get_flow_state
- ab_report
- Split String
101) Intermediate files w.r.t phase
- Does not change phase
- Implicitly Changes The Phase
- Needs The Phase To Be Manually Set
- None Of The Above
102) You can truncate a table using a Access Tab of a Table component
- Not possible
- Possible
- Need to write SQLs
- Need to write Unix scripts
103) summary.dml is used
- For reading the raw contents of summary file
- As Dmls In Aggregation Components
- Execution Of A Jobs
- For Building Automatic Dmls
104) ABLOCAL construct is used in
- SQL statements
- XFR functions
- FTP commands
- Host settings
105) Which is False here about max core
- It has no single, optimal value
- Found in Sort, Join and Rollup
- Found in Input, Output and Intermediate Datasets
- Depends upon graphs and environment
106) Find the odd one out in the below components
- Load Annotation
- Remove objects and annotation
- Load MIMEobj
- FUSE
107) Which is False about Conditional component
- Korn Shell only can be used
- C Shell can only be used
- String evaluation should return True for a component to be enabled
- Can be edited in
- File->Preferences or settings->Preferences
108) db_nodes in .dbc file means
- Database name
- Machine In Which Oracle Client Runs
- Oracle Home
- Oracle Instance
109) Identify the right syntax for referencing an environmental variable in a dbc file
- db_version: $MY_DB_VERSION
- db_version: ${MY_DB_VERSION}
- db_version: MY_DB_VERSION
- db_version: {MY_DB_VERSION}
110) m_db create myconfig.dbc -dml_file my.dml -table mytable is used for
- Creates dbc file
- Creates DML
- Creates Table
- Creates Oracle Instance
111) Which is false regarding flow-state variable
- Facilitates Detection Of Position Of Record In A Flow
- Each Partition Of Flow Has Flow-State
- Flow State Has Initial Value Of 0
- it Can Not Be Used To Define conditional Record Formats
112) Dependency analysis are not carried out on these files
- Directly specified files
- Required files
- Dependent files
- Files yet to be checked in
113) ___ substitution for parameters is used for values that contain $ as a character, such as names in database tables
- $ Substitution
- ${} substitution
- Constant
- Shell
114) Which is false about m_env
- This command does not exist
- Is Used For Querying Environment Variable
- Can Debug Configuration Variable
- Can Give Ab Initio’s Version
115) When join type is set as inner and when one of the dedup is set to not to deduplicate, ____ is achieved
- Semi Join
- Compilation Error
- Outer Join
- Soft Join
116) execute_on_miss is used when
- All Rows Are Selected
- SQL compilation error
- SQL returns no rows
- SQL connection fails
117) Find the odd one out in SFTP_TO component parameters
- extra-args
- SFTP-Script
- output-tolerated
- Transfer-Encryption
118) lookup functions when given null arguments to use a key
- Will Attempt To Match Null Values To Corresponding Key
- Never Return Null
- Will Error Out
- Help Will Be Provided
119) Which is false about Compiled transforms
- Does Not Exist
- Improves Performance
- set in Run>Settings
- Does Compilation When Running Or Deploying Graphs
120) which is not a reserved phrase here
- Little
- Bit
- ieee
- Let
121) Which is not true about global variable
- Persists For Life Of Components That Includes The Package Where The Global Variable Is Declared
- Each Component Partition Has Its Own Private Copy Of Each Global Variable Declared For That Component
- Accesses All Transform Functions In A Package
- Rules, Expressions And Statements For A Package Transform Function Can Not Read And Can Not Set Value Of Global Variables
122) m_db gendml mydb.dbc -table ‘a.mytable’ > mytable.dml
- Creates Dbc File
- CREATES DML
- Creates Table
- Creates Oracle Instance
123) Which is False about Macros
- It contains an .mpc and .ksh
- Uses mp commands to build Ksh
- Is Same As Custom Component
- Layout And Metatadata Are Special Parameters In A Macro
124) Round Robin Partitioning
- Leads to even distributions of records across all partitions
- cannot be used where there is group based partitioning
- Both
- None of the above