Hello Team,
we are following Huawei Big Data training. I would like some help regarding the course.
Can you please summarise each:
HDFS - Hadoop Distributed File System
MapReduce - Distributed Off-line Batch Processing and Yarn - Resource Negotiator
Spark2x - In-memory Distributed Computing Engine
HBase - Distributed NoSQL Database
HBase - Distributed NoSQL Database current section
Hive - Distributed Data Warehouse
Streaming - Distributed Stream Computing Engine
Flink – Stream Processing and Batch Processing Platform
Loader - Data Transformation
Flume - Massive Logs Aggregation
Kafka - Distributed Message Subscription System
Zookeeper - Cluster Distributed Coordination Service
FusionInsight HD Solution Overview
What are the features mentioned above called in Big Data? Components? Frameworks?
2. Can anyone please describe what is YARN?
3. Should we learn the architecture by heart?
https://ilearningx.huawei.com/courses/course-v1:HuaweiX+EBGTC00000246+2019.1/courseware/e439d29f7c2f4709bb3c7d61395051df/d7e8f4079d4d4ebb892e3329c736a35a/
Regards,
Roshan