- UID
- 3955
- 帖子
- 8453
- 积分
- 26469
- 阅读权限
- 80
- 注册时间
- 2005-8-19
- 最后登录
- 2013-11-17
- 在线时间
- 5148 小时
|
hadoop是谁什么,谁来扫盲
琴瑟琵琶 发表于 2010-4-16 21:45 ![]()
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. Hadoop includes these subprojects:
Hadoop Common: The common utilities that support the other Hadoop subprojects.
Avro: A data serialization system that provides dynamic integration with scripting languages.
Chukwa: A data collection system for managing large distributed systems.
HBase: A scalable, distributed database that supports structured data storage for large tables.
HDFS: A distributed file system that provides high throughput access to application data.
Hive: A data warehouse infrastructure that provides data summarization and ad hoc querying.
MapReduce: A software framework for distributed processing of large data sets on compute clusters.
Pig: A high-level data-flow language and execution framework for parallel computation.
ZooKeeper: A high-performance coordination service for distributed applications. |
|