Alan Gates describes the incubator project HCatalog, an integration tool that enables data interoperability for the Apache Hadoop* framework and external system users. This overview by an expert from the Apache Hadoop open-source community covers how HCatalog works as a table management layer for the Hadoop* framework, its ability to integrate with enterprise data management tools external to the Hadoop framework using a REST interface, statistical support, and next steps for this integration tool. Includes a description of the Apache Incubator* project process. Part of the Intel® IT Center’s Apache Hadoop Community Spotlight series. Also listen to the podcast of the interview.
Apache HDFS* overview.
Apache Pig* overview.
Apache MapReduce overview.
The Intel® Distribution for Apache Hadoop* Software
Apache Hive* overview