Course

Data Engineering in AWS

Whizlabs

Data Engineering in AWS is a foundational course in the AWS Certified Machine Learning Specialty specialization. Throughout the course, learners will gain insights into various data gathering techniques and how to analyze and handle missing data effectively. With a focus on hands-on learning, the course delves into feature extraction and selection using Principal Component Analysis and Variance Thresholds.

The course is divided into two modules. Module 1, "Introduction to Data Engineering," covers topics such as setting up an Amazon Sagemaker environment, data gathering techniques, handling missing data, and an overview of data engineering. Module 2, "Feature extraction and feature selection," explores techniques such as encoding categorical values, numerical engineering, and text feature editing. Learners will also gain practical knowledge through a project involving performing ETL operations in Glue with S3.

Certificate Available ✔

Get Started / More Info
Data Engineering in AWS
Course Modules

Data Engineering in AWS consists of two modules. Module 1 focuses on introducing data engineering, while Module 2 covers feature extraction and feature selection, including a project involving ETL operations in Glue with S3.

Introduction to Data Engineering

Welcome to the AWS Machine Learning Specialty Certification Exam course. This module provides an overview of the exam, the goals of the course, and machine learning terminology. It also covers setting up an Amazon Sagemaker environment, data gathering techniques, and handling missing data using various imputation techniques.

Feature extraction and feature selection

This module delves into feature extraction and selection, including Principal Component Analysis, Variance Thresholds, encoding categorical values, numerical engineering, and text feature editing. Learners will also receive exam tips and participate in assessments to reinforce their understanding of the course material.

More Machine Learning Courses

Machine Learning: Theory and Hands-on Practice with Python

University of Colorado Boulder

Machine Learning: Theory and Hands-on Practice with Python is a comprehensive specialization covering supervised and unsupervised learning as well as introductory...

Introducción al procesamiento de lenguaje natural

Universidad Austral

Introducción al procesamiento de lenguaje natural es un curso introductorio que te capacitará en NLP, clasificación de texto, análisis de sentimiento y más,...

Probabilistic Graphical Models 1: Representation

Stanford University

Probabilistic Graphical Models 1: Representation provides a foundational understanding of Bayesian Networks and Markov networks, essential for various fields including...

Prompt Engineering

Vanderbilt University

Unleash the power of Generative AI with Prompt Engineering to automate tasks, increase productivity, and augment human intelligence.