Save for later

Data Mining Project

Data Mining ,

Note: You should complete all the other courses in this Specialization before beginning this course. This six-week long Project course of the Data Mining Specialization will allow you to apply the learned algorithms and techniques for data mining from the previous courses in the Specialization, including Pattern Discovery, Clustering, Text Retrieval, Text Mining, and Visualization, to solve interesting real-world data mining challenges. Specifically, you will work on a restaurant review data set from Yelp and use all the knowledge and skills you’ve learned from the previous courses to mine this data set to discover interesting and useful knowledge. The design of the Project emphasizes: 1) simulating the workflow of a data miner in a real job setting; 2) integrating different mining techniques covered in multiple individual courses; 3) experimenting with different ways to solve a problem to deepen your understanding of techniques; and 4) allowing you to propose and explore your own ideas creatively. The goal of the Project is to analyze and mine a large Yelp review data set to discover useful knowledge to help people make decisions in dining. The project will include the following outputs: 1. Opinion visualization: explore and visualize the review content to understand what people have said in those reviews. 2. Cuisine map construction: mine the data set to understand the landscape of different types of cuisines and their similarities. 3. Discovery of popular dishes for a cuisine: mine the data set to discover the common/popular dishes of a particular cuisine. 4. Recommendation of restaurants to help people decide where to dine: mine the data set to rank restaurants for a specific dish and predict the hygiene condition of a restaurant. From the perspective of users, a cuisine map can help them understand what cuisines are there and see the big picture of all kinds of cuisines and their relations. Once they decide what cuisine to try, they would be interested in knowing what the popular dishes of that cuisine are and decide what dishes to have. Finally, they will need to choose a restaurant. Thus, recommending restaurants based on a particular dish would be useful. Moreover, predicting the hygiene condition of a restaurant would also be helpful. By working on these tasks, you will gain experience with a typical workflow in data mining that includes data preprocessing, data exploration, data analysis, improvement of analysis methods, and presentation of results. You will have an opportunity to combine multiple algorithms from different courses to complete a relatively complicated mining task and experiment with different ways to solve a problem to understand the best way to solve it. We will suggest specific approaches, but you are highly encouraged to explore your own ideas since open exploration is, by design, a goal of the Project. You are required to submit a brief report for each of the tasks for peer grading. A final consolidated report is also required, which will be peer-graded.

Get Details and Enroll Now

OpenCourser is an affiliate partner of Coursera and may earn a commission when you buy through our links.

Get a Reminder

Send to:
Rating 4.0 based on 5 ratings
Length 7 weeks
Starts Jul 10 (42 weeks ago)
Cost $79
From University of Illinois at Urbana-Champaign via Coursera
Instructors Jiawei Han, John C. Hart, ChengXiang Zhai
Download Videos On all desktop and mobile devices
Language English
Subjects Programming Data Science
Tags Computer Science Data Science Data Analysis Software Development

Get a Reminder

Send to:

Similar Courses

What people are saying

course content is excellent

Course content is excellent, but lack of support.

nobody from uiuc responds

Nobody from UIUC responds, and nobody has in what seems like a year.

practice the whole specialization

The project help me to practice the whole specialization algorithms and techniques.

seems like a year

lack of support

missing submissions links

Sloppy final project, missing submissions links.

no longer consistent

Sections are no longer consistent.

sloppy final

very good

Very good course!

but lack

Careers

An overview of related careers and their average salaries in the US. Bars indicate income percentile.

Set Designer Trainee $44k

Drum Set Instructor $52k

Assistant Set Medic $56k

set up 2 $61k

set and costume designer $64k

On-Set Makeup Artist $72k

On Set Photographer, DP $74k

Set Carpenter $74k

On-Set Motion Picture Set Painter $75k

Set Up Man $95k

SET Lead $114k

SET Manager $115k

Write a review

Your opinion matters. Tell us what you think.

Rating 4.0 based on 5 ratings
Length 7 weeks
Starts Jul 10 (42 weeks ago)
Cost $79
From University of Illinois at Urbana-Champaign via Coursera
Instructors Jiawei Han, John C. Hart, ChengXiang Zhai
Download Videos On all desktop and mobile devices
Language English
Subjects Programming Data Science
Tags Computer Science Data Science Data Analysis Software Development

Similar Courses

Sorted by relevance

Like this course?

Here's what to do next:

  • Save this course for later
  • Get more details from the course provider
  • Enroll in this course
Enroll Now