Hortonworks Hadoop 开发认证培训
课程介绍:
Hortonworks Apache Hadoop 开发认证培训课程是为设计、架构和开发Hadoop解决方案的开发人员,以及进行Hadoop项目规划的咨询师设计。拥有此认证可被认为具有Apache Hadoop开发的高级技能。
Developing Applications with the Hortonworks Data Platform using Java(Apache Hadoop 2.0)
课程对象:
适用于有Java经验的工程师想要了解和开发基于Java的Hadoop 2.0 MapReduce应用程序
课程长度:4天
认证考试:Hortonworks Apache Hadoop 开发认证考试费用(HCAHD)
最新时间:定制课程(内训),人满开班(公开课)
课程大纲:
Lab 1.1:Configuring a Hadoop 2.0 Development Environment
Lab1.2:Putting Files in HDFS with Java
单元2:编写MapReduce程序
Lab2.1:Word Count
Lab2.2:Distributed Grep
Lab2.3:Inverted Index
单元3:Map端聚合
Lab3.1:Using a Combiner
Lab3.2:Computing an Average
单元4:分区和排序
Lab4.1:Writing a Custom Partitioner
Lab4.2:Using Total Order Partitioner
Lab4.3:Custom Sorting
单元5:输入和输出格式
Lab5.1:Writing a Custom Input Format
Lab5.2:Customizing Output
Lab5.3:Simple Moving Average
单元6:优化 MapReduce任务
Lab6.1:Using Data Compression
Lab6.2:Defining a Raw Comparator
单元7:MapReduce特性
Lab7.1:Performing a Map--‐Side Join
Lab7.2:Using a Bloom Filter
单元8:MapReduce单元测试
Lab8.1:Unit Testing a MapReduce Job
单元9:HBase编程
Lab9.1:Importing Data into HBase
Lab9.2:An HBase MapReduce Job
单元10:Pig编程
Lab10.1:Writing a Pig UDF
Lab10.2:Writing an Accumulator UDF
单元11:Hive编程
Lab 11.1:Writing a Hive UDF
附录A:定义Oozie工作流
Oozie Lab:Defining an Oozie Workflow
Workflow Lab:TF--‐IDF and JobControl