Hello, friend!
The Hadoop big data platform can be regarded as a next-generation data warehousing system. It has the characteristics of modern data warehouses and is widely used by enterprises. Because of the scalability of MPP, the MPP-based data warehousing system is sometimes classified as a big data platform.
However, data warehouses greatly differ from the Hadoop platform in function and user experience in different scenarios. For details, see the following table.
Feature | Hadoop | Data Warehouse |
---|
Number of compute nodes | 1000s | Max 256 |
Data volume | Over 10 PB | Max 10 PB |
Data type | Relational, semi-relational, unstructured (voice, images, and video) | Relational only |
Latency | Medium to high | Low |
Application ecosystem | Innovative/AI | Traditional/BI |
Application development API | SQL and other programming language APIs, such as MapReduce | Standard database SQL |
Scalability | Unlimited, with comprehensive programming APIs | Limited, supported by UDFs |
Transaction support | Limited | Comprehensive |
Hope this helps!