---Advertisement---
Apache Flume Apache Oozie Apache Sqoop Modern Data Platforms

Sqoop, Flume and Oozie Interview Question-Answer Part – 1

By smart_answer13

Updated on:

---Advertisement---

Q.1 Which of the following options are the characteristics of Sqoop?

1.transfer legacy systems into Hadoop.

2.parallel processing

       A. Option 1

       B. Option 2

       C. Both of the options

       D. None of the options

Ans : Both of the options


Q.2 Which one of the following options is the default database of Sqoop?

       A. Hbase

       B. Oracle

       C. MySQL

       D. None of the options

Ans : MySQL


Q.3 Which one of the following options is the first step of Sqoop?

       A. Sqoop will generate JAVA classes using table schema and package them into a JAR file

       B. MapReduce jobs begin to run

       C. You need to use import command in Sqoop CLI

       D. Imported data is saved on HDFS. After this step, Hadoop will send a response back to Sqoop cli to show to the user

Ans : You need to use import command in Sqoop CLI


Q.4 coupons.com uses ____________.

       A. Flume

       B. Oozie

       C. Sqoop

       D. Zookeeper

Ans : Sqoop


Q.5 Name the process in which the data is imported, transferred, loaded and processed for future use in a database.

       A. Sqoop

       B. Flume

       C. Data Ingestion

       D. Oozie

Ans : Data Ingestion


Q.6 Which one of the following operation can transform the stream?

       A. Flume

       B. Sqoop

       C. Imphala

       D. Zookeeper

Ans : Flume


Q.7 ___________ is a data-serialization framework.

       A. Exec

       B. Apache Avro

       C. syslog

       D. TCP

Ans : Apache Avro


Q.8 Which of the following options are the necessary steps regarding the configuration file?

       A. Every source must have at least one channel

       B. The component type should be specified

       C. Every sink should have one channel

       D. All of the options

Ans : All of the options


Q.9 There can be only one Flume agent.

       A. True

       B. False

Ans : False


Q.10 Flume uses a cross-platform operating system.

       A. True

       B. False

Ans : True


Q.11 Flume offers buffer storage for real-time spikes.

       A. True

       B. False

Ans : True


Q.12 Flume agent is a kind of JVM process.

       A. True

       B. False

Ans : True


Q.13 __________ is responsible for sending the event to the channel with which it is connected.

       A. Source

       B. sink

       C. Both of the options

       D. None of the options

Ans : sink


Q.14 Which of the following options are the different types of sink in Apache Flume?

       A. File Roll

       B. Logger

       C. Null

       D. All of the options

Ans : All of the options


Q.15 Which of the following option/options is/are the channel types in Apache Flume?

       A. File Channel

       B. JDBC Channel

       C. Memory Channel

       D. All of the options

Ans : All of the options


Q.16 __________ is the software framework to migrate data from the relational database to Hadoop system and vice versa.

       A. Imphala

       B. Sqoop

       C. Data injestion

       D. Flume

Ans : Sqoop


Q.17 <hduser> sqoop import ______________.

       A. –username user

       B. –connect jdbc:mysql://localhost/

       C. –password root

       D. None of the options

Ans : –connect jdbc:mysql://localhost/


Q.18 Which one of the following type of node shows the start and end of the workflow?

       A. action Node

       B. Control Flow Node

       C. Workflow Node

Ans : Control Flow Node


Q.19 Which one of the following option is the workflow file format?

       A. .txt

       B. .html

       C. .xml

       D. None of the options

Ans : .xml


Q.20 Which of the following directory will save the configuration details of Oozie?

       A. confg

       B. Configuration

       C. conf

       D. None of the options

Ans : conf


Q.21 Oozie uses ___________ type of graph.

       A. cyclic

       B. infinite

       C. acyclic

       D. None of the options

Ans : acyclic


Q.22 The Oozie workflow should have the __ node.

       A. end

       B. start

       C. exit

       D. resume

Ans : start


Q.23 Oozie is still using a parameter called _______ to identify the YARN arguments as they are yet to create a new parameter for YARN itself.

       A. NameNode

       B. Secondary NameNode

       C. JobTracker

       D. All of the options

Ans : JobTracker


Q.24 __________ is a server-based workflow scheduling system to manage Hadoop jobs.

       A. Flume

       B. Sqoop

       C. Oozie

       D. None of the options

Ans : Oozie


Q.25 Which of the following options are characteristics of Oozie?

       A. schedule jobs as well as reschedule jobs on failure

       B. Allows creating DAG of workflows

       C. It simplifies workflow and coordination between jobs

       D. All of the options

Ans : All of the options


Q.26 __________ allows users to schedule complex workflows.

       A. Oozie Coordinator

       B. Oozie Bundles

       C. Oozie Workflow

Ans : Oozie Coordinator


Q.27 Which of the following functions does flume support?

       A. Syslog

       B. Netcat

       C. Avro

       D. All of the options

Ans : All of the options


Q.28 _______ provides logic between action nodes like start, end and kill.

       A. Workflow

       B. Control Nodes

       C. Both of the options

       D. None of the options

Ans : Control Nodes


Q.29 Which of the following options is important for multifunction Flume agents?

       A. Multiple channels

       B. Multiple Sources only

       C. Multiple Sinks only

       D. Multiple Sinks and Sources

Ans : Multiple Sinks and Sources


Q.30 We use the reducer to write out the SQL import files in Sqoop.

       A. True

       B. False

Ans : True


Q.31 The _________ node drives the actions of the workflow while the __________ node is a specific executable tied to a function.

       A. control, workflow

       B. action, control

       C. control, action

       D. None of the options

Ans : control, action


Q.32 Which Avro functions does flume set?

       A. Event

       B. Sink

       C. Source

       D. Agent

Ans : Source


Q.33 Which of the following options should be kept in mind before exporting data with reducers?

       A. Sqoop exports from HDFS do not provide the location of source files

       B. Sqoop exports have a complete understanding of the data types

       C. The correct key values should be assigned to each reducer during the partitioning

       D. None of the options

Ans : The correct key values should be assigned to each reducer during the partitioning


Q.34 The ______ channel behaves very similarly to the file channel.

       A. Memory channel

       B. JDBC Channel

       C. Flume channel

Ans : JDBC Channel


Q.35 Which of the following options is the final step while loading data into HDFS using Apache Flume?

       A. Configuring the sink

       B. Configuring the source

       C. Building the source and sink to the channel

       D. Building the sink and channel to the source

Ans : Building the source and sink to the channel


Q.36 Which of the following are applicable to Sqoop?

       A. created by Hortonworks

       B. Uses a GUID interface

       C. Designed to connect to MapReduce and to launch MapReduce jobs

       D. None of the options

Ans : Designed to connect to MapReduce and to launch MapReduce jobs


Q.37 Each row in the table is taken as a _______ in HDFS.

       A. record

       B. input

       C. row

       D. data

Ans : record


Q.38 _______ tag in action node signifies that you can make the transition to the next node.

       A. error

       B. ok

       C. None of the options

Ans : ok


Q.39 Which of the following options are Sqoop’s characteristics?

       A. It creates one-to-many MapReduce jobs to perform the export or import task

       B. It is efficient at determining the number of mappers required to import data

       C. It is a client program

       D. All of the options

Ans : All of the options


Q.40 ____________ can be MapReduce jobs, Java/Pig application, Hive, etc.

       A. Action Nodes

       B. Control Flow Node

       C. None of the options

       D. Both of the options

Ans : Action Nodes


Q.41 Sqoop will use a column to split. If that column not present then it will use primary key column.

       A. True

       B. False

Ans : True


Q.42 ___________ is used to run multiple jobs in parallel.

       A. Join

       B. Combine

       C. Distribute

       D. Fork

Ans : Fork


Q.43 What would be the correct step after Flume and Flume agent is installed?

       A. Configure the working directory

       B. Make a symlink

       C. Configure the configuration directory

       D. None of the options

Ans : Make a symlink


Q.44 ________ monitors status of coordinator jobs.

       A. Oozie Workflow

       B. Oozie Coordinator

       C. Oozie Bundles

Ans : Oozie Bundles


Q.45 _____________ are the communication and retention mechanism that manage event delivery.

       A. Sink

       B. Channel

       C. Source

       D. None of the options

Ans : Channel


Q.46 Why is Sqoop highly dependent upon the primary keys of the RDBMS?

       A. There is no work

       B. The lack of primary numeric key will increase the likelihood of undesired results

       C. The primary key will speed up indexing

       D. None of the options

Ans : The primary key will speed up indexing


Q.47 ____________ removes the events from channels and stores it into an external repository like HDFS.

       A. Source

       B. Sink

       C. Both of the options

       D. None of the options

Ans : Sink


smart_answer13

---Advertisement---

Related Post

Apache Oozie Interview Question-Answer

Q.1 Which of the following is true about oozie?        A. Oozie is an Open Source        B. Oozie is available under Apache license ...

Apache Flume Interview Question-Answer

Q.1 Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called ________ codeline.        A. NG        B. ND ...

Apache Sqoop Interview Question-Answer

Q.1 Data can be imported in maximum ______ file formats.        A. 1        B. 2        C. 3       ...

Apache Pig Interview Question-Answer

Q.1 You can run Pig in batch mode using __________ .        A. Pig shell command        B. Pig Latin statements       ...

Leave a Comment