
曙海教學(xué)優(yōu)勢
課程可定制,線上/線下/上門皆可,報(bào)名熱線:4008699035。本課程以項(xiàng)目實(shí)戰(zhàn)案例實(shí)現(xiàn)為主線,面向企事業(yè)單位項(xiàng)目開發(fā)實(shí)際,秉承21年積累的教學(xué)和研發(fā)經(jīng)驗(yàn),培訓(xùn)講師將會與您分享設(shè)計(jì)的全流程以及工具的綜合使用經(jīng)驗(yàn)以及技巧。
  我們的課程培養(yǎng)了大批受企業(yè)歡迎的工程師。曙海培訓(xùn)的課程在業(yè)內(nèi)有廣泛的美譽(yù)度。大批企業(yè)和曙海
     建立了良好的合作關(guān)系,20多年來,合作企事業(yè)單位以達(dá)30多萬。
?培訓(xùn)對象:需要使用Hadoop來進(jìn)行數(shù)據(jù)分析的數(shù)據(jù)分析員,商業(yè)分析
教學(xué)大綱:
Hadoop基礎(chǔ)
Pig基礎(chǔ)
使用Pig進(jìn)行簡單數(shù)據(jù)分析
使用Pig處理復(fù)雜數(shù)據(jù)
使用Pig分析處理多數(shù)據(jù)集
Pig排錯和優(yōu)化
Hive與Impala基礎(chǔ)
使用Hive與Impala進(jìn)行數(shù)據(jù)分析
數(shù)據(jù)管理
數(shù)據(jù)存儲與性能
使用Hive與Impala進(jìn)行數(shù)據(jù)分析
Impala如何執(zhí)行查詢/擴(kuò)展及改善性能
使用Hive分析處理文本數(shù)據(jù)
Hive優(yōu)化
擴(kuò)展Hive
如何選取數(shù)據(jù)分析工具
?
課程大綱:
Hadoop?Fundamentals?
?
??????Hadoop?Overview?
?
??????Data?Storage:?HDFS?
?
??????Distributed?Data?Processing:?YARN,?MapReduce,?and?Spark?
?
??????Data?Processing?and?Analysis:?Pig,?Hive,?and?Impala?
?
??????Data?Integration:?Sqoop?
?
??????Other?Hadoop?Data?Tools?
?
??????Exercise?Scenarios?Explanation?
?
?
?
Introduction?to?Pig?
?
??????What?Is?Pig??
?
??????Pig’s?Features?
?
??????Pig?Use?Cases?
?
??????Interacting?with?Pig?
?
Basic?Data?Analysis?with?Pig?
?
??????Pig?Latin?Syntax?
?
??????Loading?Data?
?
??????Simple?Data?Types?
?
??????Field?Definitions?
?
??????Data?Output?
?
??????Viewing?the?Schema?
?
??????Filtering?and?Sorting?Data?
?
??????Commonly-Used?Functions?
?
Processing?Complex?Data?with?Pig?
?
??????S?torage?Formats?
?
??????Complex/Nested?Data?Types?
?
??????G?rouping?
?
??????Built-In?Functions?for?Complex?Data?
?
??????Iterating?Grouped?Data?
?
Multi-Dataset?Operations?with?Pig?
?
??????Techniques?for?Combining?Data?Sets?
?
??????Joining?Data?Sets?in?Pig?
?
??????Set?Operations?
?
??????Splitting?Data?Sets?
?
Pig?Troubleshooting?and?Optimization?
?
??????Troubleshooting?Pig?
?
??????Logging?
?
??????Using?Hadoop’s?Web?UI?
?
??????Data?Sampling?and?Debugging?
?
??????Performance?Overview?