At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....



Mar 06, 2018

It was a good course, it could have been better if some examples of Spark were also provided in other Languages like Java, people without having background of python may find it difficult.


Dec 08, 2016

The assessments are quite tough compared to the course examples. Moreover, some programming basics should be given or made to understand, especially in Spark, as these are very


by Manoj D

Oct 29, 2018

The course content is good, however, main issue is with the hands on and assignments instructions - they are not completely clear and lack many things.

by Brian M

Oct 14, 2018

The lectures are so mundane and abstract. I enjoy the hands on portions, but they are so disconnected from the lectures when concepts are to be explained, that they ultimately feel like we're just executing code from the material that we may not fully understand. By the time the quizzes come up, we're expected to put everything together as if we're regularly practicing the methods used in the hands-on portions. Very disappointed.

by Ana L

Sep 03, 2018

A lot of the instructions for the assignments were incomplete or just wrong. The installation for Splunk never worked as written, forums were of no help; I ended up downloading an older version of Splunk just to complete that week's assignment. This course was extremely frustrating to complete.

by Dwayne D

Jul 27, 2018

This course provides a good overview and positioning of relevant big data technologies. In the latter weeks, the hands-on exercises become increasingly challenging, which is a good thing. I have a much better grasp of Apache Spark and its role in big data processing and integration as a result of this course.

My only significant complaint (and why I rated 4 stars vs. 5) is that the setup instructions for the environment needed for the hands-on exercises needs to be updated. I spent 1.5 days (in terms of time I allocate to continued learning) struggling with configuration of the final exercise. The forum was useful. It appears most learners who take this course after June 2018 will run into the same issue. The course administrators should update the instructions so that others won't lose time or, worse, give up on the course because of issues indirectly (at best) related to the learning objectives.

All in all, this is a very good course; and I'd recommend it.

by Roberto G C

Jul 23, 2018

Great course and content; I just would like that I've felt better prepared for the last challenge with Spark. @ week6. But still an excellent course and hands-on exercises.

by Prospero-Martin R

Jun 09, 2018

I did liked the subjects on this course! but it would be highly appreciated and would bring more potential students to this course if the setup instructions where more clear and right on the issue. Students come with various discipline backgrounds and 100% of us are new to Big Data Tools, so troubleshooting and issue while trying to do a lab or quiz is very stressful and encouraging. Even better PLEASE add a Troubleshooting section on a week's course should you see that many students are stuck with the same or similar issue...PLEASE!

by Federico S

Apr 13, 2018

I liked the subject, and I achieved my main goal with this course. So I am globally satisfied.

But last week's assigments were likely above the knowledge acquired through the lessons. It was hard to succeed on it, especially in some steps.

Eg. It doesn't make much sense to spend a long time looking for the correct syntax of the desc function to sort a table in descending order

by Scott M

Mar 19, 2018

I found the 6th week of this course to be frustrating. There was a big jump from the lessons to the final 2 tests, and the questions and directions were not well worded, a bit confusing. The biggest issue was that many students had the same questions that were blocking their progress, however there were almost no replies from teachers or staff to give some guidance, tips, etc. Some of these questions were asked over a year ago and the new students had the same questions again, and still no real activity from the teachers in the forums. In past classes that always happened.

by Sam M

Feb 18, 2018

The final assignment contained concepts that were not taught in the course: for example, how to remove leading space from a field, how to put 2 words in a tuple, how to filter lines/texts with null, how to deal with country names with more than one word (e.g. United States), etc. The final assignment requirement far more advanced Spark programming skill than what was taught in the rest of the course.

by King W N

Jan 05, 2018

This course was a lot more challenging, the assignments requires that you have some knowledge with scripting even though the practice runs through some but if you do not have the background it is very challenging. But, don't give up since you have the vm running on your local machine you can continue to practice with the course examples.

by Charles G

Dec 15, 2017

It was a tough last project and I had to use outside resources with limited examples and that is why I could not finish on time. I don't like being late on anything but this was by far my worst time management due to vague instructions and lack of supp

by Alireza A B

Oct 22, 2017

Lecture material and instructions are very limited and confusing. There are so many places that the order of the steps to perform certain tasks are flipped making the students spending several none sense hours. I wish somebody would care and review the material and fix all these issues!!!

by Jason R

Oct 21, 2017

This course continued the trend of this specialization where the lectures are full of vague jargon/diagrams and name-dropping of various applications without teaching us practical skills and then quizzing us on whether we listened to the video verbatim as opposed to challenging our minds conceptually. Only the exercises are redeeming in giving some useful, hands-on experience with some applications but then the final project required extensive googling to figure out how to work with pyspark dataframes that weren't taught in the course. Instead this course should have just been full of hands-on teaching of pyspark, mongodb, and python. Also the splunk module was a total tangent/distraction and should be dropped.

by Phillip M

Sep 04, 2017

I have to give this course a low rating, simply because the week 6 assignment "Analysis using Spark" was a terrible experience. All other assignments throughout these course have been great, but the "Analysis using Spark" assignment was poorly constructed. Essentially the assignment could not be completed as prescribed in the instructions. The data required modifying in order to complete the exercise - which I was never able to complete. The goal of the exercise was to use what we learned from the lessons and work with data frames, not deal with and repair broken csv data. This was extremely disappointing!

by Luis A R

Jun 14, 2019

Excellent course, very good material.

by Brian S

May 23, 2019

Practice activities files are outdated and a lot of the installation of downloaded tools requires manual fixing, there is no support at all from the course publishers.

by Joaquim P

May 14, 2019

I think that this course doesn't provide a substantial value to the student. It's basically a series of theoretical videos with irrelevant exercices that the student doesn't even have to think about. It's only about copy and paste until the last assignment. Until then, it's just a waste of time. Obviously it will be a good course for those people who only want the certificate and to pass the course with no effort at all, but it provides no value. On top of this, there is no technical support and I have struggled a lot in order to make everything work properly. I also suggest Coursera to give some guidance in the last assignment, there is a lot of lost people.

by Guilherme D C T

May 13, 2019

The final project is a bit tough but worth it. If you manage to finish it you'll have a new understanding of Spark RDDs and DataFrames.

by Swapnil D

May 12, 2019

Things are materialized in well manner

by Andres H

May 11, 2019

That's an excellent course I've Learned a lot about not just the Platforms Basics but also how to perform basic operations in Mongo, the tests and the practical exercises also were well planned to ensure that you know what are you doing.

by KAY A

May 07, 2019

It is very difficult when the environments don't work. This course has been very difficult to navigate

by Ranjan K G

Apr 28, 2019

Good course to start learning Mongo DB and spark basically.

by dstart

Apr 26, 2019

Very good introductory course. It makes you want to continue to learn about Spakr and MongoDB.

by Rozina S

Apr 22, 2019

It would be really helpful if there were full time teaching assistant whom we could directly contact for queries, since questions on forum many times go unanswered.

by Andreas D S

Apr 21, 2019

instalation for pyspark is not working properly