Q.1 Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called ________ codeline.
A. NG
B. ND
C. NF
D. NR
Ans : NG
Q.2 How can Flume be used with HBase?
A. HBaseSink
B. AsyncHBaseSink
C. Both A and B
D. None of these
Ans : Both A and B
Q.3 What is true about Apache Flume?
A. Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.
B. It has a simple yet flexible architecture based on streaming data flows
C. Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.
D. All of the above
Ans : All of the above
Q.4 List the various types of “Cluster Managers in Spark
A. Standalone
B. Apache Mesos
C. YARN
D. All of above
Ans : All of above
Q.5 Point out the correct statement.
A. Flume is a distributed, reliable, and available service
B. Version 1.5.2 is the eighth Flume release as an Apache top-level project
C. Flume 1.5.2 is production-ready software for integration with hadoop
D. All of the mentioned
Ans : Flume is a distributed, reliable, and available service
Q.6 A Flume agent is a JVM process which has
A. 3 components
B. 4 components
C. 5 components
D. 6 components
Ans : 3 components
Q.7 What are the important steps in the configuration?
A. Every Source must have atleast one channel
B. Every Sink must have only one channel
C. Every Components must have a specific type
D. All of the above
Ans : All of the above
Q.8 ___________ was created to allow you to flow data from a source into your Hadoop environment.
A. Imphala
B. Oozie
C. Flume
D. All of the mentioned
Ans : Flume
Q.9 Flume Big data has different levels of reliability to offer?
A. best-effort delivery
B. end-to-end delivery
C. Both A and B
D. None
Ans : Both A and B
Q.10 What are the different channel types in Flume?
A. Memory Channel
B. File Channel
C. JDBC Channel
D. All of these
Ans : All of these
Q.11 A ____________ is an operation on the stream that can transform the stream.
A. Decorator
B. Source
C. Sinks
D. All of the mentioned
Ans : Source
Q.12 Flume carries data between?
A. sources and decorator
B. sources and sinks
C. start and decorator
D. decorator and sinks
Ans : sources and sinks
Q.13 What are the tools Used in Big Data?
A. Hadoop
B. Hive
C. Pig
D. All of these
Ans : All of these
Q.14 Point out the wrong statement.
A. Version 1.4.0 is the fourth Flume release as an Apache top-level project
B. Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS
C. Flume is backwards-compatible with previous versions of the Flume 1.x codeline
D. None of the mentioned
Ans : None of the mentioned
Q.15 This gathering of data can be?
A. scheduled
B. event-driven
C. user-defined
D. Both A and B
Ans : Both A and B
Q.16 A number of ____________ source adapters give you the granular control to grab a specific file.
A. multimedia file
B. text file
C. image file
D. None of the above
Ans : text file
Q.17 ____________ is used when you want the sink to be the input source for another operation.
A. Collector Tier Event
B. Agent Tier Event
C. Basic
D. All of above
Ans : Agent Tier Event
Q.18 ___________ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.
A. Collector Tier Event
B. Agent Tier Event
C. Basic
D. All of above
Ans : Collector Tier Event
Q.19 ____________ sink can be a text file, the console display, a simple HDFS path, or a null bucket where the data is simply deleted.
A. Collector Tier Event
B. Agent Tier Event
C. Basic
D. All of above
Ans : Basic
Q.20 Flume deploys as one or more agents, each contained within its own instance of _________.
A. JVM
B. Chunks
C. Channels
D. None of the above
Ans : JVM
Q.21 Flume Hadoop can also be used to transport event data including but not limited to network traffic data, data generated by social media websites and email messages.
A. True
B. False
Ans : True