757-216-3656 | Monday–Friday 8:30 AM – 4:30 PM | info@itdojo.com

Course Duration

1 Day

Audience

Employees of federal, state and local governments; and businesses working with the government.

Prerequisites

Students with a minimum one year of experience managing open-source data frameworks such as Apache Spark or Apache Hadoop will benefit from this course.

Course Description

This course focuses on designing and deploying batch data analytics pipelines on AWS. Students learn to use services such as AWS Glue, Amazon EMR, Amazon S3, and AWS Lake Formation to ingest, catalog, transform, and analyze large datasets at scale.

Learning Objectives

  • Compare the features and benefits of data warehouses, data lakes, and modern data architectures
  • Design and implement a batch data analytics solution
  • Identify and apply appropriate techniques, including compression, to optimize data storage
  • Select and deploy appropriate options to ingest, transform, and store data
  • Choose the appropriate instance and node types, clusters, auto scaling, and network topology for a particular business use case
  • Understand how data storage and processing affect the analysis and visualization mechanisms needed to gain actionable business insights
  • Secure data at rest and in transit
  • Monitor analytics workloads to identify and remediate problems
  • Apply cost management best practices

Course Outline

  • Module A: Overview of Data Analytics and the Data Pipeline
  • Module 1: Introduction to Amazon EMR
  • Module 2: Data Analytics Pipeline Using Amazon EMR: Ingestion and Storage
  • Module 3: High-Performance Batch Data Analytics Using Apache Spark on Amazon EMR
  • Module 4: Processing and Analyzing Batch Data with Amazon EMR and Apache Hive
  • Module 5: Serverless Data Processing
  • Module 6: Security and Monitoring of Amazon EMR Clusters
  • Module 7: Designing Batch Data Analytics Solutions
  • Module B: Developing Modern Data Architectures on AWS

Frequently Asked Questions

What does the Building Batch Data Analytics Solutions on AWS course cover?

This course covers Building Batch Data Analytics Solutions on AWS training and best practices. IT Dojo delivers it as live instructor-led training with an emphasis on practical skills for government and DoD professionals.

How long is IT Dojo's Building Batch Data Analytics Solutions on AWS training?

IT Dojo's Building Batch Data Analytics Solutions on AWS training is 1 Day. It is available as live remote online instruction or on-site at your facility. All sessions are instructor-led with small class sizes to ensure individual attention.

Is this course available as live remote online training?

Yes. IT Dojo offers Building Batch Data Analytics Solutions on AWS as live remote online training. A certified instructor leads the session in real time — students interact via chat or microphone. Classes are kept small (typically no more than 16 students) to ensure engagement. On-site delivery at your government facility or contractor location is also available.

What prerequisites are recommended before this course?

Students with a minimum one year of experience managing open-source data frameworks such as Apache Spark or Apache Hadoop will benefit from this course.

Does IT Dojo offer this training on-site at government or DoD facilities?

Yes. IT Dojo delivers Building Batch Data Analytics Solutions on AWS on-site at government agencies, DoD commands, military installations, and contractor facilities. On-site training is ideal for teams of four or more and can be customized to your organization's specific environment and mission requirements. Contact IT Dojo to schedule.

How do I register for this course?

IT Dojo training is employer-sponsored — your organization registers and pays for seats. To schedule Building Batch Data Analytics Solutions on AWS for your team, contact IT Dojo via the Request Training form or call 757-216-3656. IT Dojo will work with your contracting officer, training coordinator, or program office to set up the course.

Get More Information

We cannot work with the general public. We only work with Government Agencies, Military, government contractors, and corporate clients.