What Is Hadoop?

What Is Hadoop?

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. Hadoop includes these subprojects:

    * Hadoop Common: The common utilities that support the other Hadoop subprojects.
    * Chukwa: A data collection system for managing large distributed systems.
    * HBase: A scalable, distributed database that supports structured data storage for large tables.
    * HDFS: A distributed file system that provides high throughput access to application data.
    * Hive: A data warehouse infrastructure that provides data summarization and ad hoc querying.
    * MapReduce: A software framework for distributed processing of large data sets on compute clusters.
    * Pig: A high-level data-flow language and execution framework for parallel computation.
    * ZooKeeper: A high-performance coordination service for distributed applications.

Hadoop的是什么?

Apache的Hadoop项目的发展提供可靠,可扩展,分布式计算开放源码软件。 Hadoop的包括以下子项目:

    
* Hadoop的共同点:即支持其他Hadoop的子项目的共同事业。
    
* Chukwa:一个管理大型分布式系统的数据收集系统。
    
* HBase:一种可扩展,分布式数据库,支持结构化数据大表存储。
    
* HDFS:一个分布式的文件系统,提供了高吞吐量的应用程序数据访问。
    
*
Hive:一种数据仓库基础设施,提供数据汇总和特设查询。
    
* MapReduce的:一个大型的分布式数据处理软件框架集的计算集群。
    
*
Pig:一种高层次的数据流语言和并行计算的执行框架。
    
*
ZooKeeper:一个分布式应用的高性能的协调服务。

你可能感兴趣的:(FY_NoSQL)