Chevron Left
返回到 Big Data Integration and Processing

學生對 加州大学圣地亚哥分校 提供的 Big Data Integration and Processing 的評價和反饋

1,967 個評分
415 條評論


At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....



Mar 06, 2018

It was a good course, it could have been better if some examples of Spark were also provided in other Languages like Java, people without having background of python may find it difficult.


Sep 25, 2016

Best course taking into account the first three. Good material, more in depth than the other ones. Very well explained. Useful to get a sense of various interesting topics and orientative.


326 - Big Data Integration and Processing 的 350 個評論(共 403 個)

創建者 Shalaka M

Oct 16, 2017

I wish that the Spark programming should have been covered in more details as was the MongoDB and Splunk covered.

創建者 Tatiana M

Feb 28, 2017

A little slower than the last ones, not my favorite but great use of hands-on projects and enagagement

創建者 SU C G

Oct 23, 2016

Needs more depth. Instructors should reference more external readings since the lectures are brief.

創建者 Anirudh

Dec 25, 2016

some of the stuff was not mentioned properly, like for last quiz how to expoert data from mongodb

創建者 Pablo A L Z

Jul 28, 2020


創建者 Nester P

Sep 10, 2017

The last assignment of Week 6 was far more advanced than the rest of the material.

創建者 Luís O

Apr 27, 2020

You could focus more on Spark that is the widely software for Big data Processing

創建者 Erik P

Oct 14, 2017

very nice, just wish the environment was using docker instead of virtual machines

創建者 Silvia C R S

Oct 28, 2017

I think that there should be more exercises for MongoDB and Spark assigments.

創建者 Ramathmika

Apr 02, 2020

Not very efficient hands-on practices but overall good learning experience

創建者 Dev A S

Jan 20, 2020

Good course. But we couldn't relate the theoretical videos with hands-on.

創建者 Vadim C

Nov 05, 2016

The final assessment somewhat not really well designed, imho.

創建者 Ashish J

Aug 25, 2017

spark hands on should have been more instructive.


Aug 05, 2019

Good info, just a lot of info to digest.

創建者 Konstantin K

Mar 01, 2018

All is good except the Splunk case

創建者 Ho S J

Jun 22, 2017

Very difficult final exam.

創建者 Brajesh L S

Apr 27, 2020

Tough one.

創建者 Keith B

Mar 31, 2017

The course and presentations were very informational and good. I enjoyed that aspect. I would have rated the course 4-5 star based on that. The reason for the low rating was the 6th module, and the fact that I felt very ill prepared for the syntax of creating all the operations in Spark (building out the Jupiter notebook). We really did not cover much of that, and it was quite punishing to search the web and sources to make things work. Even the instructions to export the csv file were misleading at best. I have a full time job and a family, I am not some young undergraduate with copious amounts of time to waste. While I am not opposed to some searching of other sources, I would like to have more of the useful information taught so that it is not so much of a burden. I believe that if you are going to test people on something, you should at least cover it in some sense.


Mar 03, 2019

With deep regrets I feel obliged to share a negative rating on the course. While the course material/video lectures are average to good (no rocket science but well done introduction into the subjects), the hands-on exercises and particularly the technical environment, i.e. Cloudera VM is entirely messed-up: - setup scripts are not working/ are outdated (e.g., anaconda requires no-check-certificate); user permissions are all set wrong and need to be corrected; firefox outdated with update function not working; countless error around spark context (SC) variables.... and so on... For a course that is so prominently promoted on the platform the least expectation is that the provided environment works and that students don´t need to spend hours on google to figure out how to debug the cloudera image.... Here, imo, a much better job can be done!

創建者 Silvain d M

Aug 23, 2017

Although the contents of the course is good, I found that the hands-on exercises needed to pass tests were problematic due to many errors occurring when trying to setup the tools or running provided scripts. This means most of the limited time I have for this was spend browsing the course forums and the internet chasing solutions for errors occurring in the exercises and not on actually working on the assignments.

Also the course makes you install several tools/apps. In itself it is good to be exposed to these tools, however some of these are only used to a limited extent, while still taking time to install and setup. Worst is one of the tools requiring personal information in order to be downloaded and as a consequence being chased by sales reps for the tool.

創建者 Christoph S

Apr 03, 2020

Again I'm torn between quitting this specialization and biting through the rest of it. While the course is good on the high-level view, the link to the low level, the tools and their application just doesn't work well for me. The different tools are presented and used just enough to scrape a tiny lttle bit of the surface, then you're heading on the next chapter. Like in the previous courses, the tools in the VM sometimes need quite a bit on tinkering until everything works as expected. The main drawdown in this course was the final test that I did not felt prepared for at all. On the bright side, you learn to love the Spark manual...


創建者 Andrew D

Oct 14, 2016

Overall this course does have some good content and delivers big data concepts. However, as others have mentioned some content (especially in early modules) could either be combined or ommited. Key focus areas on Spark and MongoDB are not given enough focus and lab time.

The quizes have badly worded questions. Finally the last assignment required to pass the class has bad directions and covers content not reviewed in the class. Spent a frustrating amount of time trying to get what most likely is simple code to work.

I'm hoping this particular module is revised. For those just interested in learning Spark or Mongo and not doing the certificate program you can probably get better learning from doing your own research.

創建者 Joaquim P

May 14, 2019

I think that this course doesn't provide a substantial value to the student. It's basically a series of theoretical videos with irrelevant exercices that the student doesn't even have to think about. It's only about copy and paste until the last assignment. Until then, it's just a waste of time. Obviously it will be a good course for those people who only want the certificate and to pass the course with no effort at all, but it provides no value. On top of this, there is no technical support and I have struggled a lot in order to make everything work properly. I also suggest Coursera to give some guidance in the last assignment, there is a lot of lost people.

創建者 Ryan H

Jun 12, 2017

Again, another course in this series shows a lack of effort in its quiz construction. By the final week, you are presented with a challenge that will require numerous hours pouring over different documentations of both pyspark and MongoDB because there is a lack of essential knowledge being taught in the course. The final "project" is based on a very small amount of what was learned, and as it so happens, only a small amount of what was needed was actually taught. I'm hoping for improvement with the rest of the course, because the majority of this course was good, but the final week just ruined the experience.

創建者 Guillem P G

Jan 10, 2017

The last assignment of the course is, compared to the others, more difficult. In my case, I ran into several errors which I couldn't get help in solving by using the course Forum, as the end of course deadline was just a few days ahead. I had to analyze the tweet texts for the last graded assignment without using Spark framework (nor any of the other "Big Data" tools explored in the course).

I also found some of the videos by PhD. Amarnath Gupta were difficult to understand, his examples were unclear and, in my opinion, too complex and difficult to follow and understand what was the reasoning thread.