Syllabus |
Fast Analytics on Fast Data with Apache Kudu
Durasi
3 Hari
Deskripsi
Apache Kudu adalah software gratis yang terkait dengan mekanisme penyimpanan data yang berorientasi pada kolom , dimana apache kudu juga merupakan bagian dari ekosistem Apache Hadoop. software ini kompatibel dengan sebagian besar kerangka kerja pemrosesan data di lingkungan Hadoop. dan melengkapi layer penyimpanan Hadoop dalam rangka mencapai tujuan "Fast analytics on Fast data"
Persyaratan
Untuk bisa mengikuti pelatihan ini peserta telah memiliki pengetahuan dalam :
· Skill dasar di Hadoop
· Memahami mekanisme HDFS
· Dasar pemograman Java / Python
Target Training
Setelah pelatihan ini Anda diharapkan memiliki skill dalam hal
· Kudu Functionality
· Arsitektur Kudu
· Bagaimana Kudu membantu di Hadoop
Target Peserta
· Data Analytics
· Data Science
· Data Architect
· Developer
Detail Materi :
About Apache Kudu
Concepts and Terms
Columnar Datastore
Raft Consensus Algorithm
Table
Tablet
Tablet Server
Master
Catalog Table
Logical Replication
Architectural Overview
Example Use Cases
Apache Kudu (incubating) Release Notes
Introducing Apache Kudu (incubating)
About the Kudu Public Beta
Resources
Installation Options
Kudu Release Notes
New Features in Kudu
Other Improvements and Changes in Kudu
Issues Fixed in Kudu
Incompatible Changes in Kudu
Limitations of Kudu
Upgrade Notes for Kudu
Kudu Installation Requirements
Install Kudu Using the Command Line
Installing and Using Apache Impala (incubating) With Apache Kudu
Installing Impala_Kudu Using Cloudera Manager
Installing Impala_Kudu Parcels Using the deploypy Script
Installing Impala_Kudu Parcels Manually
Installing Impala_Kudu Packages
Installing Impala_Kudu Using the Command Line
Internal and External Impala Tables
Querying an Existing Kudu Table In Impala
Creating a New Kudu Table From Impala
Understanding SQL Operators and Kudu
Failures During INSERT, UPDATE, and DELETE Operations
Example Apache Impala Commands With Kudu
Integration with MapReduce, YARN, and Other Frameworks
Issues Starting or Restarting the Table Server
Error during hole punch test
Clock is not synchronized
deploypy script exits with create: error: too few arguments
Troubleshooting Performance Issues