Apache Flume Interview Question-Answer

Q.1 Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called ________ codeline.

       A. NG

       B. ND

       C. NF

       D. NR

Ans : NG


Q.2 How can Flume be used with HBase?

       A. HBaseSink

       B. AsyncHBaseSink

       C. Both A and B

       D. None of these

Ans : Both A and B


Q.3 What is true about Apache Flume?

       A. Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.

       B. It has a simple yet flexible architecture based on streaming data flows

       C. Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.

       D. All of the above

Ans : All of the above


Q.4 List the various types of “Cluster Managers in Spark

       A. Standalone

       B. Apache Mesos

       C. YARN

       D. All of above

Ans : All of above


Q.5 Point out the correct statement.

       A. Flume is a distributed, reliable, and available service

       B. Version 1.5.2 is the eighth Flume release as an Apache top-level project

       C. Flume 1.5.2 is production-ready software for integration with hadoop

       D. All of the mentioned

Ans : Flume is a distributed, reliable, and available service


Q.6 A Flume agent is a JVM process which has

       A. 3 components

       B. 4 components

       C. 5 components

       D. 6 components

Ans : 3 components


Q.7 What are the important steps in the configuration?

       A. Every Source must have atleast one channel

       B. Every Sink must have only one channel

       C. Every Components must have a specific type

       D. All of the above

Ans : All of the above


Q.8 ___________ was created to allow you to flow data from a source into your Hadoop environment.

       A. Imphala

       B. Oozie

       C. Flume

       D. All of the mentioned

Ans : Flume


Q.9 Flume Big data has different levels of reliability to offer?

       A. best-effort delivery

       B. end-to-end delivery

       C. Both A and B

       D. None

Ans : Both A and B


Q.10 What are the different channel types in Flume?

       A. Memory Channel

       B. File Channel

       C. JDBC Channel

       D. All of these

Ans : All of these


Q.11 A ____________ is an operation on the stream that can transform the stream.

       A. Decorator

       B. Source

       C. Sinks

       D. All of the mentioned

Ans : Source


Q.12 Flume carries data between?

       A. sources and decorator

       B. sources and sinks

       C. start and decorator

       D. decorator and sinks

Ans : sources and sinks


Q.13 What are the tools Used in Big Data?

       A. Hadoop

       B. Hive

       C. Pig

       D. All of these

Ans : All of these


Q.14 Point out the wrong statement.

       A. Version 1.4.0 is the fourth Flume release as an Apache top-level project

       B. Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS

       C. Flume is backwards-compatible with previous versions of the Flume 1.x codeline

       D. None of the mentioned

Ans : None of the mentioned


Q.15 This gathering of data can be?

       A. scheduled

       B. event-driven

       C. user-defined

       D. Both A and B

Ans : Both A and B


Q.16 A number of ____________ source adapters give you the granular control to grab a specific file.

       A. multimedia file

       B. text file

       C. image file

       D. None of the above

Ans : text file


Q.17 ____________ is used when you want the sink to be the input source for another operation.

       A. Collector Tier Event

       B. Agent Tier Event

       C. Basic

       D. All of above

Ans : Agent Tier Event


Q.18 ___________ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.

       A. Collector Tier Event

       B. Agent Tier Event

       C. Basic

       D. All of above

Ans : Collector Tier Event


Q.19 ____________ sink can be a text file, the console display, a simple HDFS path, or a null bucket where the data is simply deleted.

       A. Collector Tier Event

       B. Agent Tier Event

       C. Basic

       D. All of above

Ans : Basic


Q.20 Flume deploys as one or more agents, each contained within its own instance of _________.

       A. JVM

       B. Chunks

       C. Channels

       D. None of the above

Ans : JVM


Q.21 Flume Hadoop can also be used to transport event data including but not limited to network traffic data, data generated by social media websites and email messages.

       A. True

       B. False

Ans : True


Leave a Comment