This Specialization by Johns Hopkins University is designed for data scientists familiar with R, aiming to leverage the Tidyverse for data science. Through 5 courses, participants will master importing, wrangling, visualizing, and modeling data using the powerful Tidyverse framework.
What You'll Learn:
Certificate Available ✔
Get Started / More InfoThe Tidyverse Skills for Data Science in R course modules cover introduction to the Tidyverse, importing data, data wrangling, data visualization, and data modeling with a focus on practical application.
Participants will learn to distinguish between tidy and non-tidy data and describe the Tidyverse ecosystem of packages. They will also gain the skills to organize and initialize a data science project.
Students will be able to describe different data formats, apply Tidyverse functions to import data into R from external formats, and obtain data from a web API.
Participants will apply Tidyverse functions to transform non-tidy data to tidy data, conduct basic exploratory data analysis, and analyze text data.
Students will distinguish between various types of plots and their uses, use the ggplot2 R package to develop data visualizations, build effective data summary tables, and create data animations for visual storytelling.
Participants will describe different types of data analytic questions, conduct hypothesis tests of data, apply linear modeling techniques to answer multivariable questions, and apply machine learning workflows to detect complex patterns in data.
Business Data Management and Communication is a comprehensive specialization covering advanced accounting, big data analysis, and effective communication of valuable...
Data Mining Project offers hands-on experience in designing and implementing real-world data mining projects, providing step-by-step guidance from problem formulation...
Introduction to Microsoft Azure Synapse Analytics provides a comprehensive understanding of designing and implementing data solutions using Azure Synapse Analytics...
Serverless Data Processing with Dataflow: Develop Pipelines is an in-depth course covering Apache Beam SDK, streaming data processing, stateful transformations,...