Course

Data Science at Scale

University of Washington

Explore intermediate topics in data science with the University of Washington's Specialization in Data Science at Scale. Gain hands-on experience with scalable SQL and NoSQL data management solutions, data mining algorithms, and practical statistical and machine learning concepts.

Designed to equip learners with the skills to visualize data, communicate results, and address legal and ethical issues related to big data, this Specialization culminates in a real-world Capstone Project in partnership with Coursolve, a digital internship platform.

  • Learn the landscape of relevant systems and principles for scalable data analytics platforms.
  • Design statistical experiments and apply modern methods for predictive analytics.
  • Master the art of communicating data science results and explore ethical considerations surrounding big data.
  • Engage in a real-world Capstone Project involving the entire data science pipeline.

Certificate Available ✔

Get Started / More Info
Data Science at Scale
Course Modules

This Specialization covers scalable data manipulation, predictive analytics, effective communication of data science results, and a real-world Capstone Project in partnership with Coursolve.

Data Manipulation at Scale: Systems and Algorithms

Data analysis is the cornerstone of evidence-based decision making, and this module focuses on scalable data manipulation systems and algorithms. Gain insight into practical systems derived from the frontier of research in computer science and learn to "think" in MapReduce for effective algorithm writing. Explore the landscape of specialized Big Data systems for graphs, arrays, and streams.

Practical Predictive Analytics: Models and Methods

Statistical experiment design and analytics are central to data science. This module covers the design of statistical experiments, resampling methods, and a core set of practical machine learning methods and concepts. Additionally, it delves into the common idioms of large-scale graph analytics.

Communicating Data Science Results

This module emphasizes the importance of effective visualization, ethical considerations around big data, and the use of cloud computing for reproducible data analysis. Gain insight into the state-of-the-art in privacy, ethics, and governance related to big data and data science.

Data Science at Scale - Capstone Project

In the Capstone Project, students engage in a real-world project requiring them to apply skills from the entire data science pipeline. Through a collaboration with Coursolve, students work on projects associated with partner stakeholders, gaining practical experience in data science projects.

More Data Analysis Courses

3D SARS-CoV-19 Protein Visualization With Biopython

Coursera Project Network

Create an interactive 3D representation of SARS-CoV-19 protein using Biopython in this hands-on project, providing insights into bioinformatics and medical research....

Datastore: Qwik Start

Google Cloud

Learn to store and query data in Google Cloud Datastore using the Google Cloud Platform.

Linear Regression for Business Statistics

Rice University

Linear Regression for Business Statistics teaches you to apply various procedures in Microsoft Excel. Learn to build, estimate, and interpret linear regression models...

Support Vector Machines in Python, From Start to Finish

Coursera Project Network

Learn to build a Support Vector Machine for classification using scikit-learn and the Radial Basis Function (RBF) Kernel.