Q.1 Which of the following options are the characteristics of Sqoop?
1.transfer legacy systems into Hadoop.
2.parallel processing
A. Option 1
B. Option 2
C. Both of the options
D. None of the options
Ans : Both of the options
Q.2 Which one of the following options is the default database of Sqoop?
A. Hbase
B. Oracle
C. MySQL
D. None of the options
Ans : MySQL
Q.3 Which one of the following options is the first step of Sqoop?
A. Sqoop will generate JAVA classes using table schema and package them into a JAR file
B. MapReduce jobs begin to run
C. You need to use import command in Sqoop CLI
D. Imported data is saved on HDFS. After this step, Hadoop will send a response back to Sqoop cli to show to the user
Ans : You need to use import command in Sqoop CLI
Q.4 coupons.com uses ____________.
A. Flume
B. Oozie
C. Sqoop
D. Zookeeper
Ans : Sqoop
Q.5 Name the process in which the data is imported, transferred, loaded and processed for future use in a database.
A. Sqoop
B. Flume
C. Data Ingestion
D. Oozie
Ans : Data Ingestion
Q.6 Which one of the following operation can transform the stream?
A. Flume
B. Sqoop
C. Imphala
D. Zookeeper
Ans : Flume
Q.7 ___________ is a data-serialization framework.
A. Exec
B. Apache Avro
C. syslog
D. TCP
Ans : Apache Avro
Q.8 Which of the following options are the necessary steps regarding the configuration file?
A. Every source must have at least one channel
B. The component type should be specified
C. Every sink should have one channel
D. All of the options
Ans : All of the options
Q.9 There can be only one Flume agent.
A. True
B. False
Ans : False
Q.10 Flume uses a cross-platform operating system.
A. True
B. False
Ans : True
Q.11 Flume offers buffer storage for real-time spikes.
A. True
B. False
Ans : True
Q.12 Flume agent is a kind of JVM process.
A. True
B. False
Ans : True
Q.13 __________ is responsible for sending the event to the channel with which it is connected.
A. Source
B. sink
C. Both of the options
D. None of the options
Ans : sink
Q.14 Which of the following options are the different types of sink in Apache Flume?
A. File Roll
B. Logger
C. Null
D. All of the options
Ans : All of the options
Q.15 Which of the following option/options is/are the channel types in Apache Flume?
A. File Channel
B. JDBC Channel
C. Memory Channel
D. All of the options
Ans : All of the options
Q.16 __________ is the software framework to migrate data from the relational database to Hadoop system and vice versa.
A. Imphala
B. Sqoop
C. Data injestion
D. Flume
Ans : Sqoop
Q.17 <hduser> sqoop import ______________.
A. –username user
B. –connect jdbc:mysql://localhost/
C. –password root
D. None of the options
Ans : –connect jdbc:mysql://localhost/
Q.18 Which one of the following type of node shows the start and end of the workflow?
A. action Node
B. Control Flow Node
C. Workflow Node
Ans : Control Flow Node
Q.19 Which one of the following option is the workflow file format?
A. .txt
B. .html
C. .xml
D. None of the options
Ans : .xml
Q.20 Which of the following directory will save the configuration details of Oozie?
A. confg
B. Configuration
C. conf
D. None of the options
Ans : conf
Q.21 Oozie uses ___________ type of graph.
A. cyclic
B. infinite
C. acyclic
D. None of the options
Ans : acyclic
Q.22 The Oozie workflow should have the __ node.
A. end
B. start
C. exit
D. resume
Ans : start
Q.23 Oozie is still using a parameter called _______ to identify the YARN arguments as they are yet to create a new parameter for YARN itself.
A. NameNode
B. Secondary NameNode
C. JobTracker
D. All of the options
Ans : JobTracker
Q.24 __________ is a server-based workflow scheduling system to manage Hadoop jobs.
A. Flume
B. Sqoop
C. Oozie
D. None of the options
Ans : Oozie
Q.25 Which of the following options are characteristics of Oozie?
A. schedule jobs as well as reschedule jobs on failure
B. Allows creating DAG of workflows
C. It simplifies workflow and coordination between jobs
D. All of the options
Ans : All of the options
Q.26 __________ allows users to schedule complex workflows.
A. Oozie Coordinator
B. Oozie Bundles
C. Oozie Workflow
Ans : Oozie Coordinator
Q.27 Which of the following functions does flume support?
A. Syslog
B. Netcat
C. Avro
D. All of the options
Ans : All of the options
Q.28 _______ provides logic between action nodes like start, end and kill.
A. Workflow
B. Control Nodes
C. Both of the options
D. None of the options
Ans : Control Nodes
Q.29 Which of the following options is important for multifunction Flume agents?
A. Multiple channels
B. Multiple Sources only
C. Multiple Sinks only
D. Multiple Sinks and Sources
Ans : Multiple Sinks and Sources
Q.30 We use the reducer to write out the SQL import files in Sqoop.
A. True
B. False
Ans : True
Q.31 The _________ node drives the actions of the workflow while the __________ node is a specific executable tied to a function.
A. control, workflow
B. action, control
C. control, action
D. None of the options
Ans : control, action
Q.32 Which Avro functions does flume set?
A. Event
B. Sink
C. Source
D. Agent
Ans : Source
Q.33 Which of the following options should be kept in mind before exporting data with reducers?
A. Sqoop exports from HDFS do not provide the location of source files
B. Sqoop exports have a complete understanding of the data types
C. The correct key values should be assigned to each reducer during the partitioning
D. None of the options
Ans : The correct key values should be assigned to each reducer during the partitioning
Q.34 The ______ channel behaves very similarly to the file channel.
A. Memory channel
B. JDBC Channel
C. Flume channel
Ans : JDBC Channel
Q.35 Which of the following options is the final step while loading data into HDFS using Apache Flume?
A. Configuring the sink
B. Configuring the source
C. Building the source and sink to the channel
D. Building the sink and channel to the source
Ans : Building the source and sink to the channel
Q.36 Which of the following are applicable to Sqoop?
A. created by Hortonworks
B. Uses a GUID interface
C. Designed to connect to MapReduce and to launch MapReduce jobs
D. None of the options
Ans : Designed to connect to MapReduce and to launch MapReduce jobs
Q.37 Each row in the table is taken as a _______ in HDFS.
A. record
B. input
C. row
D. data
Ans : record
Q.38 _______ tag in action node signifies that you can make the transition to the next node.
A. error
B. ok
C. None of the options
Ans : ok
Q.39 Which of the following options are Sqoop’s characteristics?
A. It creates one-to-many MapReduce jobs to perform the export or import task
B. It is efficient at determining the number of mappers required to import data
C. It is a client program
D. All of the options
Ans : All of the options
Q.40 ____________ can be MapReduce jobs, Java/Pig application, Hive, etc.
A. Action Nodes
B. Control Flow Node
C. None of the options
D. Both of the options
Ans : Action Nodes
Q.41 Sqoop will use a column to split. If that column not present then it will use primary key column.
A. True
B. False
Ans : True
Q.42 ___________ is used to run multiple jobs in parallel.
A. Join
B. Combine
C. Distribute
D. Fork
Ans : Fork
Q.43 What would be the correct step after Flume and Flume agent is installed?
A. Configure the working directory
B. Make a symlink
C. Configure the configuration directory
D. None of the options
Ans : Make a symlink
Q.44 ________ monitors status of coordinator jobs.
A. Oozie Workflow
B. Oozie Coordinator
C. Oozie Bundles
Ans : Oozie Bundles
Q.45 _____________ are the communication and retention mechanism that manage event delivery.
A. Sink
B. Channel
C. Source
D. None of the options
Ans : Channel
Q.46 Why is Sqoop highly dependent upon the primary keys of the RDBMS?
A. There is no work
B. The lack of primary numeric key will increase the likelihood of undesired results
C. The primary key will speed up indexing
D. None of the options
Ans : The primary key will speed up indexing
Q.47 ____________ removes the events from channels and stores it into an external repository like HDFS.
A. Source
B. Sink
C. Both of the options
D. None of the options
Ans : Sink