Explore intermediate topics in data science with the University of Washington's Specialization in Data Science at Scale. Gain hands-on experience with scalable SQL and NoSQL data management solutions, data mining algorithms, and practical statistical and machine learning concepts.
Designed to equip learners with the skills to visualize data, communicate results, and address legal and ethical issues related to big data, this Specialization culminates in a real-world Capstone Project in partnership with Coursolve, a digital internship platform.
Certificate Available ✔
Get Started / More InfoThis Specialization covers scalable data manipulation, predictive analytics, effective communication of data science results, and a real-world Capstone Project in partnership with Coursolve.
Data analysis is the cornerstone of evidence-based decision making, and this module focuses on scalable data manipulation systems and algorithms. Gain insight into practical systems derived from the frontier of research in computer science and learn to "think" in MapReduce for effective algorithm writing. Explore the landscape of specialized Big Data systems for graphs, arrays, and streams.
Statistical experiment design and analytics are central to data science. This module covers the design of statistical experiments, resampling methods, and a core set of practical machine learning methods and concepts. Additionally, it delves into the common idioms of large-scale graph analytics.
This module emphasizes the importance of effective visualization, ethical considerations around big data, and the use of cloud computing for reproducible data analysis. Gain insight into the state-of-the-art in privacy, ethics, and governance related to big data and data science.
In the Capstone Project, students engage in a real-world project requiring them to apply skills from the entire data science pipeline. Through a collaboration with Coursolve, students work on projects associated with partner stakeholders, gaining practical experience in data science projects.
Create an interactive 3D representation of SARS-CoV-19 protein using Biopython in this hands-on project, providing insights into bioinformatics and medical research....
Learn to store and query data in Google Cloud Datastore using the Google Cloud Platform.
Linear Regression for Business Statistics teaches you to apply various procedures in Microsoft Excel. Learn to build, estimate, and interpret linear regression models...
Learn to build a Support Vector Machine for classification using scikit-learn and the Radial Basis Function (RBF) Kernel.