Unsupervised Algorithms in Machine Learning

Unsupervised Algorithms in Machine Learning

This course is part of Machine Learning: Theory and Hands-on Practice with Python Specialization

Taught in English

Some content may not be translated

Instructor: Geena Kim

3,285 already enrolled

Included with Coursera Plus

Learn more

Course

Gain insight into a topic and learn the fundamentals

Intermediate level

Recommended experience

38 hours (approximately)

Flexible schedule

Learn at your own pace

Progress towards a degree

Learn more

View course modules

What you'll learn

Explain what unsupervised learning is, and list methods used in unsupervised learning.
List and explain algorithms for various matrix factorization methods, and what each is used for.
List and explain algorithms for various matrix factorization methods, and what each is used for.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

6 quizzes

Course

Gain insight into a topic and learn the fundamentals

Intermediate level

Recommended experience

38 hours (approximately)

Flexible schedule

Learn at your own pace

Progress towards a degree

Learn more

View course modules

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Build your subject-matter expertise

This course is part of the Machine Learning: Theory and Hands-on Practice with Python Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

There are 4 modules in this course

One of the most useful areas in machine learning is discovering hidden patterns from unlabeled data. Add the fundamentals of this in-demand skill to your Data Science toolkit. In this course, we will learn selected unsupervised learning methods for dimensionality reduction, clustering, and learning latent features. We will also focus on real-world applications such as recommender systems with hands-on examples of product recommendation algorithms.

Prior coding or scripting knowledge is required. We will be utilizing Python extensively throughout the course. College-level math skills, including Calculus and Linear Algebra, are needed. It is recommended, but not required, to take the first course in the specialization, Introduction to Machine Learning: Supervised Learning. This course can be taken for academic credit as part of CU Boulder’s MS in Data Science or MS in Computer Science degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more: MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder Course logo image by Ryan Wallace on Unsplash.

Now that you have a solid foundation in Supervised Learning, we shift our attention to uncovering the hidden structure from unlabeled data. We will start with an introduction to Unsupervised Learning. In this course, the models no longer have labels to learn from. They need to make sense of the data from the observations themselves. This week we are diving into Principal Component Analysis, PCA, a foundational dimension reduction technique. When you first start learning this topic, it might not seem easy. There is undoubtedly some math involved in this section. However, PCA can be grasped conceptually, perhaps more readily than anticipated. In the Supervised Learning course, we struggled with the Curse of Dimensionality. This week, we will see how PCA can reduce the number of dimensions and improve classification/regression tasks. You will have reading, a quiz, and a Jupyter notebook lab/Peer Review to implement the PCA algorithm.

What's included

3 videos10 readings3 quizzes1 peer review2 discussion prompts1 ungraded lab

3 videosTotal 33 minutes

Unsupervised Learning Introduction 15 minutesPreview module
Intuition7 minutes
How it works 10 minutes

10 readingsTotal 75 minutes

Earn Academic Credit for your Work! 10 minutes
Course Support10 minutes
Pre-Requisite Course Knowledge1 minute
Course Textbooks5 minutes
Things of Note for Programming Assignments2 minutes
Information on Peer Reviews0 minutes
Honor Code5 minutes
Module 1 Slides10 minutes
ISLR 12.2: Principal Component Analysis 30 minutes
Fashion-MNIST Dataset2 minutes

3 quizzesTotal 45 minutes

Programming Assignments Quiz5 minutes
Honor Code Quiz10 minutes
Week 1 Quiz30 minutes

1 peer reviewTotal 180 minutes

Week 1: PCA180 minutes

2 discussion promptsTotal 20 minutes

USL Introduction10 minutes
Peer Review Expectations10 minutes

1 ungraded labTotal 180 minutes

Week 1: PCA180 minutes

This week, we are working with clustering, one of the most popular unsupervised learning methods. Last week, we used PCA to find a low-dimensional representation of data. Clustering, on the other hand, finds subgroups among observations. We can get a meaningful intuition of the data structure or use a procedure like Cluster-then-predict. Clustering has several applications ranging from marketing customer segmentation and advertising, identifying similar movies/music, to genomics research and disease subtypes discovery. We will focus our efforts mainly on K-means clustering and hierarchical clustering with consideration to the benefits and disadvantages of both and the choice of metrics like distance or linkage. We have reading, a quiz, and a Jupyter notebook lab/Peer Review this week.

What's included

2 videos2 readings1 quiz1 peer review1 discussion prompt1 ungraded lab

2 videosTotal 23 minutes

Clustering Introduction 10 minutesPreview module
Hierarchical Clustering 12 minutes

2 readingsTotal 20 minutes

Module 2 Slides10 minutes
ISLR 12.4: Clustering Methods10 minutes

1 quizTotal 60 minutes

Week 2 Quiz60 minutes

1 peer reviewTotal 180 minutes

Week 2: Clustering180 minutes

1 discussion promptTotal 10 minutes

Week 2: Cluster Metrics10 minutes

1 ungraded labTotal 180 minutes

Week 2: Clustering 180 minutes

This week we are working with Recommender Systems. Websites like Netflix, Amazon, and YouTube will surface personalized recommendations for movies, items, or videos. This week, we explore Recommendation Engines' strategies to predict users' likes. We will consider popularity, content-based, and collaborative filtering approaches, and what similarity metrics to use. As we work with Recommendation Systems, there are challenges, like the time complexity of operations and sparse data. This week is relatively math dense. You will have a quiz wherein you will work with different similarity metric calculations. Give yourself time for this week's Jupyter notebook lab and consider performant implementations. The Peer Review section this week is short.

What's included

4 videos1 reading1 quiz1 programming assignment1 peer review

4 videosTotal 37 minutes

Recommender System Introduction11 minutesPreview module
Similarity Measures7 minutes
Calculating Similarity Examples 13 minutes
Recommender Systems in Large Scale4 minutes

1 readingTotal 10 minutes

Module 3 Slides10 minutes

1 quizTotal 90 minutes

Week 3 Quiz90 minutes

1 programming assignmentTotal 300 minutes

Week 3: Recommender Systems300 minutes

1 peer reviewTotal 30 minutes

Week 3: Recommender Systems30 minutes

We are already at the last week of course material! Get ready for another dense math week. Last week, we learned about Recommendation Systems. We used a Neighborhood Method of Collaborative Filtering, utilizing similarity measures. Latent Factor Models, including the popular Matrix Factorization (MF), can also be used for Collaborative Filtering. A 1999 publication in Nature made Non-negative Matrix Factorization extremely popular. MF has many applications, including image analysis, text mining/topic modeling, Recommender systems, audio signal separation, analytic chemistry, and gene expression analysis. For this week, we focus on Singular Value Decomposition, Non-negative Matrix Factorization, and Approximation methods. This week, we have reading, a quiz, and a Kaggle mini-project utilizing matrix factorization to categorize news articles.

What's included

5 videos1 reading1 quiz1 peer review

5 videosTotal 54 minutes

Matrix Factorization Introduction 11 minutesPreview module
Eigen Decomposition 10 minutes
Singular Value Decomposition 9 minutes
Non-negative Matrix Factorization15 minutes
NMF Optimization 7 minutes

1 readingTotal 10 minutes

Module 4 Slides10 minutes

1 quizTotal 30 minutes

Week 4 Quiz30 minutes

1 peer reviewTotal 720 minutes

Week 4: BBC News Classification Kaggle Mini-Project720 minutes

Instructor

Geena Kim

University of Colorado Boulder

3 Courses19,876 learners

Offered by

University of Colorado Boulder

Recommended if you're interested in Machine Learning

University of Colorado Boulder
Introduction to Deep Learning
Make progress toward a degree
Course
University of Colorado Boulder
Machine Learning: Theory and Hands-on Practice with Python
Make progress toward a degree
Specialization
Coursera Project Network
Create a Picture Puzzle using Java Swing
Guided Project
University of Colorado Boulder
Unsupervised Text Classification for Marketing Analytics
Make progress toward a degree
Course

Get a head start on your degree

This course is part of the following degree programs offered by University of Colorado Boulder. If you are admitted and enroll, your coursework can count toward your degree learning and your progress can transfer with you.

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

New to Machine Learning? Start here.

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

A cross-listed course is offered under two or more CU Boulder degree programs on Coursera. For example, Dynamic Programming, Greedy Algorithms is offered as both CSCA 5414 for the MS-CS and DTSA 5503 for the MS-DS.

· You may not earn credit for more than one version of a cross-listed course.

· You can identify cross-listed courses by checking your program’s student handbook.

· Your transcript will be affected. Cross-listed courses are considered equivalent when evaluating graduation requirements. However, we encourage you to take your program's versions of cross-listed courses (when available) to ensure your CU transcript reflects the substantial amount of coursework you are completing directly in your home department. Any courses you complete from another program will appear on your CU transcript with that program’s course prefix (e.g., DTSA vs. CSCA).

· Programs may have different minimum grade requirements for admission and graduation. For example, the MS-DS requires a C or better on all courses for graduation (and a 3.0 pathway GPA for admission), whereas the MS-CS requires a B or better on all breadth courses and a C or better on all elective courses for graduation (and a B or better on each pathway course for admission). All programs require students to maintain a 3.0 cumulative GPA for admission and graduation.

Yes. Cross-listed courses are considered equivalent when evaluating graduation requirements. You can identify cross-listed courses by checking your program’s student handbook.

You may upgrade and pay tuition during any open enrollment period to earn graduate-level CU Boulder credit for << this course/ courses in this specialization>>. Because << this course is / these courses are >> cross listed in both the MS in Computer Science and the MS in Data Science programs, you will need to determine which program you would like to earn the credit from before you upgrade.

MS in Data Science (MS-DS) Credit: To upgrade to the for-credit data science (DTSA) version of << this course / these courses >>, use the MS-DS enrollment form. See How It Works.

MS in Computer Science (MS-CS) Credit: To upgrade to the for-credit computer science (CSCA) version of << this course / these courses >>, use the MS-CS enrollment form. See How It Works.

If you are unsure of which program is the best fit for you, review the MS-CS and MS-DS program websites, and then contact datascience@colorado.edu or mscscoursera-info@colorado.edu if you still have questions.

Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:

The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

Unsupervised Algorithms in Machine Learning

Course

What you'll learn

Skills you'll gain

Details to know

Course

See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise

Earn a career certificate

There are 4 modules in this course

Unsupervised Learning Intro

What's included

Clustering

What's included

Recommender System

What's included

Matrix Factorization

What's included

Instructor

Offered by

Recommended if you're interested in Machine Learning

Introduction to Deep Learning

Machine Learning: Theory and Hands-on Practice with Python

Create a Picture Puzzle using Java Swing

Unsupervised Text Classification for Marketing Analytics

Get a head start on your degree

Master of Science in Computer Science

Master of Science in Data Science

Why people choose Coursera for their career

New to Machine Learning? Start here.

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

What is a cross-listed course?

Can I take cross-listed courses to fulfill my degree requirements?

How do I upgrade and earn credit from CU Boulder?

More questions