Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.
提供方
課程信息
您將獲得的技能
- Big Data
- Neo4j
- Knime
- Splunk
提供方

加州大学圣地亚哥分校
UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory.
授課大綱 - 您將從這門課程中學到什麼
Simulating Big Data for an Online Game
This week we provide an overview of the Eglence, Inc. Pink Flamingo game, including various aspects of the data which the company has access to about the game and users and what we might be interested in finding out.
Acquiring, Exploring, and Preparing the Data
Next, we begin working with the simulated game data by exploring and preparing the data for ingestion into big data analytics applications.
Data Classification with KNIME
This week we do some data classification using KNIME.
Clustering with Spark
This week we do some clustering with Spark.
Graph Analytics of Simulated Chat Data With Neo4j
This week we apply what we learned from the 'Graph Analytics With Big Data' course to simulated chat data from Catch the Pink Flamingos using Neo4j. We analyze player chat behavior to find ways of improving the game.
審閱
- 5 stars66.06%
- 4 stars21.85%
- 3 stars5.91%
- 2 stars1.79%
- 1 star4.37%
來自大数据 - 毕业项目的熱門評論
Really interesting insights into the general overview of the big data specialization with brain-teasing hands-on exercises and a look to hoe reporting various big data analytics should be undertaken
Thank you Coursera and instructors for creating this course. The structure is very good. Looking forward for completing other specializations too. Thank you!!
The project is really helpful to sum up the whole process of the 5 previous courses, but there is a bit problem with the week 4 assignment.
Very engaging course. Well designed and delivered. I also liked the breadth and depth of the course. Liked it and continue use the material as reference
關於 大数据 專項課程
Drive better business decisions with an overview of how big data is organized, analyzed, and interpreted. Apply your insights to real-world problems and questions.

常見問題
我什么时候能够访问课程视频和作业?
我订阅此专项课程后会得到什么?
有助学金吗?
還有其他問題嗎?請訪問 學生幫助中心。