Question: A company decided to perform real time analytics on its Web server logs to gain insights on website visitors, behavior, crawlers accessing the site, business

A company decided to perform real time analytics on its Web server logs to gain insights on
website visitors, behavior, crawlers accessing the site, business insights, security issues, and more.
For this, the log file entries are to be streamed to an HDFS folder in a Hadoop cluster. Streaming the
logs to HDFS is to be done only when 1000 or more entries are made to the webserver log.
Configure a Flume agent to implement the streaming with webserver log as the source and an
HDFS folder as the sink.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!