Course

Use the Apache Spark Structured Streaming API with MongoDB

Coursera Project Network

Learn to use the Apache Spark Structured Streaming API with Python to stream data from two different sources. You will also explore storing a dataset in the MongoDB database and joining two datasets using the powerful capabilities of Apache Spark.

Throughout this guided project, you will gain hands-on experience in leveraging the structured streaming API to continuously capture data from various sources, such as the file system or TCP/IP sockets. This project will provide you with practical skills in processing and analyzing streaming data, with a specific focus on capturing data from weather stations for historical purposes.

Key Learning Objectives:

  • Utilize the Apache Spark Structured Streaming API with Python to stream data from two different sources
  • Store a dataset in the MongoDB database using the Apache Spark Structured Streaming API
  • Perform dataset joins with Apache Spark Structured Streaming

Certificate Available ✔

Get Started / More Info
Use the Apache Spark Structured Streaming API with MongoDB
More Data Management Courses

Advanced MySQL Topics

Meta

Advanced MySQL Topics is a comprehensive course covering advanced database engineering skills in MySQL, including optimization techniques, data analytics, and the...

Creating Database Tables with SQL

Coursera Project Network

Creating Database Tables with SQL is a comprehensive course that guides learners through the process of defining, creating, and managing relational database tables...

Introduction to Regular Expressions in SQL

Coursera Project Network

Introduction to Regular Expressions in SQL is a project-based course where you will learn to use POSIX regular expressions for extensive pattern matching in SQL...

Prep for Microsoft Azure Data Engineer Associate Cert DP-203

SkillUp EdTech

Prep for Microsoft Azure Data Engineer Associate Cert DP-203 equips you to excel in designing, processing, securing, and optimizing data storage with Azure Databricks...