Member-only story
APACHE STORM INTERVIEW QUESTIONS AND ANSWERS
Big Data Technology
1. Q : WHAT IS APACHE STORM ?
ANS : Apache Storm is a distributed real-time big data-processing system that is used for Real-time stream processing. Storm is designed to process vast amount of data in a fault-tolerant and horizontal scalable method. Storm is stateless, it manages distributed environment and cluster state via Apache ZooKeeper. It is simple and you can execute all kinds of manipulations on real-time data in parallel. Storm guarantees that every message will be processed through the topology at least once.
2. WHAT ARE THE CORE COMPONENTS OF APACHE STORM ?
- 1. Tuple
- 2. Stream
- 3. Spouts
- 4. Bolts
1. Tuple :
Tuple is the main data structure in Storm. It is a list of ordered elements. It supports all data types. It is modelled as a set of comma separated values and passed to a Storm cluster.
2. Stream :
Stream is an unordered sequence of tuples.
3.Spouts :
Source of stream. Storm accepts input data from raw data sources like Twitter Streaming API, Apache Kafka queue etc. Otherwise you can write spouts to read data from datasources. “ISpout” is the core interface for implementing spouts. Some of the specific interfaces are IRichSpout, BaseRichSpout, KafkaSpout, etc.