Save for later

Introduction to Apache Spark and AWS

Heads up! This course may be archived and/or unavailable.

Learn to analyze big data using Apache Spark's distributed computing framework. In a series of focused, practical tasks, you will start by launching a spark cluster on Amazon's EC2 cloud computing platform. As you progress to working with real data, you will gain exposure to a variety of useful tools, including RDFlib and SPARQL. The practical tasks on this course make use of the Gutenberg Project data - the world's largest open collection of ebooks. This offers no end of opportunity for highly engaging and novel analyses. As the taught material and example code is given in Python, it is strongly recommended that all students have previous Python programming experience. Furthermore, launching and interacting with a cluster on EC2 requires basic knowledge of Unix command line, and some experience with a command-line editor such as vim or nano would also be advantageous. With these minimal prerequisites, this course is designed to get you up and running in Spark as quickly and painlessly as possible, so that by the end, you will be comfortable and competent enough to start engineering your own big data solutions.

Get Details and Enroll Now

OpenCourser is an affiliate partner of Coursera and may earn a commission when you buy through our links.

Get a Reminder

Send to:
Rating 2.6 based on 14 ratings
Length 5 weeks
Effort 4 weeks of study, 3-6 hours/week
Starts Aug 27 (296 weeks ago)
Cost $49
From University of London via Coursera
Instructors Dr Sorrel Harriet, Christophe Rhodes
Download Videos On all desktop and mobile devices
Language English
Subjects Data Science Engineering Science
Tags Data Science Data Analysis Physical Science And Engineering Research Methods

Get a Reminder

Send to:

Similar Courses

What people are saying

most video discussions can

Most video discussions can be cover with short lecture notes that can show precise commands, output and etc.

can show precise commands

teaching ive experienced .the

This seems to be the worst teaching Ive experienced .The good thing is i never pay for cousera courses unless I find them educative, so i didn't loose anything but time.

gave 1 point less

On two occasion I felt peers in my class did not follow the rubric to grade my assignment and gave 1 point less.

get an o'reilly book.sorry

Get an O'Reilly book.Sorry, UoL, I appreciate the effort, but Coursera isnt about putting yourself on the map with a MOOC, it is about asking yourself "what does someone following my course take from it?"

ends up searching high

The user ends up searching high and low just to set up and semi-stable environment.

frankly an *autistic* course.no

This is frankly an *autistic* course.No regard for the needs and average technical level of the audience, COMPLETE DISCONNECT between the teaching staff and the proclaimed objective of the course, very poor delivery of the content.If you want to spend a couple of weeks debugging because the instructions are virtually non-existent, or mixed, for various versions of Spark or Python, trying to make sense of misleading cues in the lectures and riding a very steep curve with Spark on EC2 almost alone, go ahead.I only completed the course because I don't like to leave stuff unfinished.

go ahead.i only completed

complete disconnect between

too basic operations

Too basic operations on the Spark, EC2 and S3.

architecture deployment options

Some outdated components It will be much easier to pick up the course materials if started with a brief introduction on Spark architecture deployment options.

average technical level

Careers

An overview of related careers and their average salaries in the US. Bars indicate income percentile.

Volunteer Big Data Engineer $48k

Data Scientist - Big Data $68k

Big Data and AWS Data Lake $73k

Big Data Developer (Streaming Data) $77k

Big data developer with AWS $78k

Research Scientist Big Data $94k

Big Data Developer Consultant $98k

Big Data Engineer 6 $107k

Big data and ETL specialist $121k

Big Data Specialist $149k

Principal Big Data Architect $180k

Senior Big Data Sales $181k

Write a review

Your opinion matters. Tell us what you think.

Rating 2.6 based on 14 ratings
Length 5 weeks
Effort 4 weeks of study, 3-6 hours/week
Starts Aug 27 (296 weeks ago)
Cost $49
From University of London via Coursera
Instructors Dr Sorrel Harriet, Christophe Rhodes
Download Videos On all desktop and mobile devices
Language English
Subjects Data Science Engineering Science
Tags Data Science Data Analysis Physical Science And Engineering Research Methods

Similar Courses

Sorted by relevance

Like this course?

Here's what to do next:

  • Save this course for later
  • Get more details from the course provider
  • Enroll in this course
Enroll Now