Objectives of the training
In this course, you'll discover how to use Amazon EMR to process data through the Hadoop ecosystem. You will also learn how to create Big Data environments, how to use Amazon DynamoDB, Amazon Redshift and Amazon Kinesis, Amazon QuickSight, Amazon Athena and how best to apply best practices to design secure and cost-effective Big Data environments.Targeted audience
People in charge of designing and implementing Big Data solutions, such as solution architects as well as data analysts wishing to discover Big Data solutions on AWS.Prerequisite
• Basic knowledge of Big Data technologies, including Apache Hadoop and HDFS Pig, Hive and MapReduce.
• Know how to use the main AWS services and public cloud implementation.
• Participants must have taken the "AWS Basics" course or have an equivalent level of experience.
• Understanding of the concepts of data warehousing, relational database systems and database design is recommended.
Trainers
Benefits for Participants
Understand Apache Hadoop applications in the context of Amazon EMR
Identify the components of an Amazon EMR cluster
Launch and configure an Amazon EMR cluster
Use the common programming frameworks available for Amazon EMR, including Hive, Pig and Streaming
Use Hue to enhance the usability of Amazon EMR
Use in-memory analysis with Spark on Amazon EMR
Identify the benefits of using Amazon Kinesis for near real-time Big Data processes
Use Amazon Redshift to efficiently store and analyze data
Understand and manage the costs and security of a Big Data solution
Securing a Big Data solution
Identify options for retrieving, transferring and compressing data
Understanding Amazon Athena for ad-hoc query analysis
Use visualization software to represent data and queries via Amazon QuickSight
Orchestrate the flow of Big Data via AWS Data Pipeline.
Course architecture
Introduction to Big Data on AWS
Overview of Big Data
Retrieving and Transferring Big Data
Streaming Big Data and Amazon Kinesis
Big Data Storage Solutions
Big Data Processing and Analysis
The Hadoop Ecosystem
Apache Hadoop and Amazon EMR
Using Amazon EMR
Hadoop Programming Frameworks
Web Interfaces on Amazon EMR
Apache Spark on Amazon EMR
Big Data and AWS
Amazon Redshift and Big Data
Visualizing and Orchestrating Big Data
Managing Big Data Costs
Securing Your Amazon Deployments
Big Data Design Patterns.
Pedagogical details
Type of training
Private or personalized training
If you have more than 8 people to sign up for a particular course, it can be delivered as a private session right at your offices. Contact us for more details.
Request a quotePrivate or personalized training
If you have more than 8 people to sign up for a particular course, it can be delivered as a private session right at your offices. Contact us for more details.
Request a quote