In this course we start at foundational level with Big Data technical essentials where you can learn the foundations of hadoop, big data technology in this course for anyone who wants to get started using. Course Schedule This is a weekdays course that will be held July 16 - August 8, 2019 US Pacific Time The class sessions will be held-Monday, Wednesday every week 6:30-8:30 PM US Pacific time, each day. Please check your local date and time for first session. Prerequisite Desired but not required - Exposure to, Working proficiency of BI, sql, scripting, how to handle and manage data and databases, using Excel, java programming language Course Features 4 weeks, 8 sessions, 16 hours of total LIVE Instruction Training material, instructor handouts and access to useful resources on the cloud provided Practical Hands on Lab exercises on cloud workstations provided Actual code and scripts provided Real-life Scenarios Course Outline Session 1: Big Data Basics An introduction to Big Data? Why is Big Data? Why now? The Three Dimensions of Big Data (Three Vs) Evolution of Big Data Big Data versus Traditional RDBMS Databases Big Data versus Traditional BI and Analytics Big Data versus Traditional Storage Key Challenges in Big Data adoption Benefits of adoption of Big Data Introduction to Big Data Technology Stack Apache Hadoop Framework Introduction to Microsoft HDInsight – Microsoft’s Big Data Service Hands-On Lab: Creating Azure Storage Account Creating HDInsight Cluster Using services on HDInsight Cluster Session 2: The Big Data Technology Stack Basics of Hadoop Distributed File System (HDFS) Basics of Hadoop Distributed Processing (Map Reduce Jobs) Hands-On Lab: Loading files to Azure storage account Moving files across HDInsight Cluster Remote Access to Azure Storage Account and HDInsight Cluster Session 3: Deep dive into Hadoop Storage System (HDFS) (1 Hour) HDFS Reading files with HDFS Writing files with HDFS Error Handling Hands-On Lab: Accessing Hadoop configuration files using HDInsight Cluster Session 4: Processing Big Data –MapReduce and YARN How MapReduce works Handling Common Errors Bottlenecks with MapReduce How YARN (MapReduceV2) works Difference between MR1 and MR2 Error Handling Hands-On Lab: Running a simple MapReduce application (word count) Running a custom MapReduce application (census data) Running MapReduce via PowerShell Running a MapReduce application using PowerShell Monitoring application status Session 5: Big Data Development Framework Introduction to HIVE Introduction to PIG HBase Hands-On Lab: Loading the data into HIVE Submitting Pig jobs using HDInsight Submitting Pig jobs via PowerShell Session 6: Big Data Integration and Management Big Data Integration using Polybase Big Data Management using Ambari Hands-On Lab: Fetching HDInsight data into SQL Using Ambari for managing HDInsight cluster Session 7: Map Reduce Session 8: Big Data Analytics Refund Policy 100% refund can be applied if request is initiated 24 hours before the 1st course session. If a class is rescheduled/cancelled by the organizer, registered students will be offered a credit towards any future course or a 100% refund.