Hadoop DPE Quiz
Hadoop DPE Quiz contain set of 30 MCQ questions for Hadoop DPE MCQ which will help you to clear beginner level quiz.
1) Replicating data oon multiple data nodes helps achieve which of the characteristic of Hadoop(HDFS)
- High Availability
- Fault tolerance
- Both A and B
- None of the above
2) Hadoop ships the code to the data instead of sending the data to the code. What does this help achieve ?
- Process large volumes of data
- Hardware efficiency
- Lesser storage
- None of the above
3) Mapreduce can be used for
- Machine learning
- ETL
- Both A and B
- None of the above
4) What is the primary regulatory concern for cloud computing
- Lack of transparency of cloud providers
- Data stroage beyond national boundaries
- Lock in by cloud providers
- None of the above
5) Disaster recovery in cluod enironment can be achieved efficiently through
- Replicated data centers
- Using multiple cloud providers
- Backing up data
- None of the above
6) On-demand self service helps in cloud services to achieve
- Automation
- Resource pooling
- Reliability
- High utilization
7) Proprietary APIs to access cloud services results in
- Efficient usage
- Customization of services
- Data Lock in
- none of the above
8) Large data moving across boundaries of clouds – this use case is not suitable when
- Using multiple cloud providers
- Using private clouds
- cost is the primary consideration
- all of the above
9) Deployment automation can be achieved through
- Efficient algorithms
- Library of scripts (recipes)
- Effective monitoring
- None of the above
10) Internet latencies in cloud envirnoment makes it inevitable to keep static data closer to
- compute servers
- end users
- Distributed equally between end users and servers
- None of the above
11) Consistency compromise is tolerated in
- Relational DB
- NoSQL DB
- VLDB
- all of the above
12) Algorithms which learn from and make predictions on data can be categorized as
- Machine learning
- Numerical
- Randomized
- None of the above
13) To takeĀ full advantage of Hoizontal scaling following can be used
- Partitions or shards
- Replication
- Proxies
- Global cache
14) Which of the properties is challenging to implement in an In Memory Database
- Atomicity
- Consistency
- Isolation
- Durability
15) Which feature is not a significant factor for the private cloud
- Self service portal
- Metered billing
- Rapid elasticity
- Replication
16) In a ______ cloud, an organization might use a public cloud service, such as Amazon’s Elastic Compute Cloud (EC2) for general computing but store customer data within its own data center.
- Public
- Private
- Hybrid
- all of the above
17) What is Not true about private cloud
- Has to be managed internally
- Operated solely for a single organization
- Provides hosted services behind firewall
- managed and hosted externally by third party
18) The purpose of load balancing is
- Optimum resource usage
- Maximum throughput
- Minimize response time
- all of the above
19) What best practices are applicable for a secure cloud environment
- Protect data in transit
- Protect data at rest
- Protect credentials
- all of the above
20) Containers are more efficient than virtual machines for
- Porting applications
- System resource requirements
- OS usage
- None of the above
21) Hadoop can be run in which of the following modes
- Standalone
- Psuedo-distributed
- Fully distributed
- all of the above
22) Volume, velocity and variety are used to characterize
- Web applications
- Big Data
- Cloud applications
- Mobile Apps
23) If a node appears to be running slow, the master node can redundantly execute another instance of the same task and first output will be taken .this process is called as
- proactive scheduling
- parallel processing
- Distributed processing
- speculative execution
24) Which of the following are the main aspects for the qualilites of architecture?
- Conceptual Integrity
- Bulidability
- Correctness and Completeness
- all of the above
25) Which of the following scenarios makes HDFS unavailable?
- Jobtracker failure
- Tasktracker failure
- Data Node failure
- Namenode failure
26) What are the drawback for Layered architecture
- Information hiding
- It is often necessary to pass data through many layers, which can slow performance
- Both A and B
- None of the above
27) Which of these does not belong to the qualities of operational requirements
- Portability
- Reuse
- Loose coupling
- all of the above
28) What are the techniques for selecting alternative requirements
- Stakeholder participation
- crucial expectation
- multidimensional ranking
- pros and cons
29) Which MapReduce stage serves as a barrier, where all previous stages must be completed before it may proceed?
- Shuffle
- Reduce
- combine
- Write
30) Which steps are included in use case driven iterative development
- At each iteration, one or more usecases are selected for implementation
- Iteration should be followed until system is complete
- Iterative development builds system functionality gradually through analysis, design, coding, testing and evaluation.
- all of the above