Data Mining Methods

Data Mining Methods

This course is part of Data Mining Foundations and Practice Specialization

Taught in English

Some content may not be translated

Instructor: Qin (Christine) Lv

4,906 already enrolled

Included with Coursera Plus

Learn more

Course

Gain insight into a topic and learn the fundamentals

3.9

(17 reviews)

Intermediate level

Recommended experience

24 hours (approximately)

Flexible schedule

Learn at your own pace

Progress towards a degree

Learn more

View course modules

What you'll learn

Identify the core functionalities of data modeling in the data mining pipeline
Apply techniques that can be used to accomplish the core functionalities of data modeling and explain how they work.
Evaluate data modeling techniques, determine which is most suitable for a particular task, and identify potential improvements.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Course

Gain insight into a topic and learn the fundamentals

3.9

(17 reviews)

Intermediate level

Recommended experience

24 hours (approximately)

Flexible schedule

Learn at your own pace

Progress towards a degree

Learn more

View course modules

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Build your subject-matter expertise

This course is part of the Data Mining Foundations and Practice Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

There are 4 modules in this course

This course covers the core techniques used in data mining, including frequent pattern analysis, classification, clustering, outlier analysis, as well as mining complex data and research frontiers in the data mining field.

This course can be taken for academic credit as part of CU Boulder’s MS in Data Science or MS in Computer Science degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more: MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder Course logo image courtesy of Lachlan Cormie, available here on Unsplash: https://unsplash.com/photos/jbJp18srifE

This week starts with an overview of this course, Data Mining Methods, then focuses on frequent pattern analysis, including the Apriori algorithm and FP-growth algorithm for frequent itemset mining, as well as association rules and correlation analysis.

What's included

15 videos3 readings1 programming assignment1 discussion prompt

15 videosTotal 152 minutes

Meet Your Instructor!4 minutesPreview module
Preparing for this Specialization9 minutes
Data Mining: Technique View11 minutes
Frequent Pattern Analysis, Apriori Algorithm 12 minutes
Apriori Algorithm Example, Details9 minutes
Example: Apriori Frequent Pattern Analysis8 minutes
Apriori Algorithm Challenges and Improvements13 minutes
FP-growth Algorithm, Example16 minutes
Association Rule, Example8 minutes
Correlation, Example13 minutes
Other Correlation Measures10 minutes
Example: FP-growth Frequent Pattern Analysis9 minutes
Example: Monotonic and Anti-monotonic Constraints9 minutes
Example: Lift Correlation7 minutes
Example: X^2 Correlation7 minutes

3 readingsTotal 30 minutes

Earn Academic Credit for Your Work! 10 minutes
Course Support10 minutes
About This Course10 minutes

1 programming assignmentTotal 240 minutes

Frequent Pattern Analysis240 minutes

1 discussion promptTotal 10 minutes

Introduce Yourself!10 minutes

This week introduces supervised learning, classification, prediction, and covers several core classification methods including decision tree induction, Bayesian classification, support vector machines, neural networks, and ensemble methods. It also discusses classification model evaluation and comparison.

What's included

9 videos1 programming assignment

9 videosTotal 125 minutes

Introduction to Classification 15 minutesPreview module
Decision Tree Induction, Example20 minutes
Bayesian Classification, Example17 minutes
Example: Decision Tree Induction Classification 9 minutes
Example: Bayesian Classification8 minutes
Support Vector Machines9 minutes
Neural Network 12 minutes
Ensemble, Model Evaluation20 minutes
Model Selection13 minutes

1 programming assignmentTotal 240 minutes

Classification 240 minutes

This week introduces you to unsupervised learning, clustering, and covers several core clustering methods including partitioning, hierarchical, grid-based, density-based, and probabilistic clustering. Advanced topics for high-dimensional clustering, bi-clustering, graph clustering, and constraint-based clustering are also discussed.

What's included

8 videos1 reading1 programming assignment

8 videosTotal 106 minutes

Introduction to Clustering11 minutesPreview module
Partitioning Methods16 minutes
Hierarchical and Grid Based Clustering16 minutes
Density-Based Clustering10 minutes
Probabilistic Clustering10 minutes
EM Clustering11 minutes
High Dimensional, Bi-Clustering, Graph Clustering16 minutes
Constraint Based Clustering12 minutes

1 readingTotal 10 minutes

EM Clustering: Further Explanation 10 minutes

1 programming assignmentTotal 240 minutes

Clustering240 minutes

This week discusses three different types of outliers (global, contextual, and collective) and how different methods may be used to identify and analyze such outliers. It also covers some advanced methods for mining complex data, as well as the research frontiers of the data mining field.

What's included

8 videos1 peer review

8 videosTotal 109 minutes

Types of Outliers12 minutesPreview module
Anomaly Detection Methods 115 minutes
Anomaly Detection Methods 216 minutes
Anomaly Detection Examples 12 minutes
Sequence and Time Series Data14 minutes
Graph and Online Social Network Data10 minutes
Web Data, KDD Conference 13 minutes
Data Mining Research Frontiers14 minutes

1 peer reviewTotal 180 minutes

Peer Review: Outlier Analysis, Research Frontiers180 minutes

Instructor

Qin (Christine) Lv

University of Colorado Boulder

3 Courses9,072 learners

Offered by

University of Colorado Boulder

Recommended if you're interested in Data Analysis

University of Colorado Boulder
Data Mining Project
Make progress toward a degree
Course
University of Illinois at Urbana-Champaign
Text Retrieval and Search Engines
Course
University of Colorado Boulder
Data Mining Pipeline
Make progress toward a degree
Course
University of Illinois at Urbana-Champaign
Text Mining and Analytics
Course

Get a head start on your degree

This course is part of the following degree programs offered by University of Colorado Boulder. If you are admitted and enroll, your coursework can count toward your degree learning and your progress can transfer with you.

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

New to Data Analysis? Start here.

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

A cross-listed course is offered under two or more CU Boulder degree programs on Coursera. For example, Dynamic Programming, Greedy Algorithms is offered as both CSCA 5414 for the MS-CS and DTSA 5503 for the MS-DS.

· You may not earn credit for more than one version of a cross-listed course.

· You can identify cross-listed courses by checking your program’s student handbook.

· Your transcript will be affected. Cross-listed courses are considered equivalent when evaluating graduation requirements. However, we encourage you to take your program's versions of cross-listed courses (when available) to ensure your CU transcript reflects the substantial amount of coursework you are completing directly in your home department. Any courses you complete from another program will appear on your CU transcript with that program’s course prefix (e.g., DTSA vs. CSCA).

· Programs may have different minimum grade requirements for admission and graduation. For example, the MS-DS requires a C or better on all courses for graduation (and a 3.0 pathway GPA for admission), whereas the MS-CS requires a B or better on all breadth courses and a C or better on all elective courses for graduation (and a B or better on each pathway course for admission). All programs require students to maintain a 3.0 cumulative GPA for admission and graduation.

Yes. Cross-listed courses are considered equivalent when evaluating graduation requirements. You can identify cross-listed courses by checking your program’s student handbook.

You may upgrade and pay tuition during any open enrollment period to earn graduate-level CU Boulder credit for << this course/ courses in this specialization>>. Because << this course is / these courses are >> cross listed in both the MS in Computer Science and the MS in Data Science programs, you will need to determine which program you would like to earn the credit from before you upgrade.

MS in Data Science (MS-DS) Credit: To upgrade to the for-credit data science (DTSA) version of << this course / these courses >>, use the MS-DS enrollment form. See How It Works.

MS in Computer Science (MS-CS) Credit: To upgrade to the for-credit computer science (CSCA) version of << this course / these courses >>, use the MS-CS enrollment form. See How It Works.

If you are unsure of which program is the best fit for you, review the MS-CS and MS-DS program websites, and then contact datascience@colorado.edu or mscscoursera-info@colorado.edu if you still have questions.

Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:

The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

Data Mining Methods

Course

What you'll learn

Skills you'll gain

Details to know

Course

See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise

Earn a career certificate

There are 4 modules in this course

Frequent Pattern Analysis

What's included

Classification

What's included

Clustering

What's included

Outlier Analysis

What's included

Instructor

Offered by

Recommended if you're interested in Data Analysis

Data Mining Project

Text Retrieval and Search Engines

Data Mining Pipeline

Text Mining and Analytics

Get a head start on your degree

Master of Science in Data Science

Master of Science in Computer Science

Why people choose Coursera for their career

New to Data Analysis? Start here.

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

What is a cross-listed course?

Can I take cross-listed courses to fulfill my degree requirements?

How do I upgrade and earn credit from CU Boulder?

More questions