We may earn an affiliate commission when you visit our partners.
Course image
Google Cloud Training

This is a self-paced lab that takes place in the Google Cloud console. This lab shows you how to create a Google Cloud Dataproc cluster, run a simple Apache Spark job in the cluster, then modify the number of workers in the cluster, all from the gcloud Command Line. Watch these short videos, Dataproc: Qwik Start - Qwiklabs Preview and Run Spark and Hadoop Faster with Cloud Dataproc.

Enroll now

What's inside

Syllabus

Dataproc: Qwik Start - Command Line

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Core audience is seasoned data professionals familiar with the Google Cloud Platform
Students are expected to have some programming knowledge
Provides hands-on practice with using the Google Cloud console

Save this course

Save Dataproc: Qwik Start - Command Line to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Dataproc: Qwik Start - Command Line with these activities:
Find an experienced engineer or consultant to mentor you
Seek guidance from an experienced engineer or consultant who can provide you with personalized advice and support.
Browse courses on Google Cloud Dataproc
Show steps
  • Identify potential mentors.
  • Reach out to them and ask for their guidance.
Read a relevant documentation or blog post
Read a blog post or documentation on Google Cloud Dataproc that covers cluster creation and management.
Browse courses on Google Cloud Dataproc
Show steps
  • Find a relevant blog post or documentation.
  • Read the article.
Follow an online tutorial for setting up a Dataproc cluster
Follow an online tutorial to create a Dataproc cluster. Use either Google Cloud's documentation or a third party tutorial.
Browse courses on Google Cloud Dataproc
Show steps
  • Find an appropriate tutorial
  • Follow the steps outlined in the tutorial
Five other activities
Expand to see all activities and additional details
Show all eight activities
Run a simple Apache Spark job
Review another video tutorial from one of the other resources provided that covers Apache Spark job creation and execution.
Browse courses on Apache Spark
Show steps
  • Watch the video tutorial.
  • Follow along with the tutorial.
Practice creating and modifying Dataproc clusters
Practice creating and modifying a Dataproc cluster in the Google Cloud console or using the gcloud command line.
Browse courses on Google Cloud Dataproc
Show steps
  • Create a new Dataproc cluster.
  • Modify the number of workers in the cluster.
  • Delete the cluster.
Practice running Apache Spark jobs on a Dataproc cluster
Practice running Apache Spark jobs on a Dataproc cluster. This will help you gain hands-on experience with the platform.
Browse courses on Apache Spark
Show steps
  • Create a Dataproc cluster.
  • Submit an Apache Spark job to the cluster.
  • Monitor the job's progress.
Build a personal project using Google Cloud Dataproc
Create a project that leverages Google Cloud Dataproc to solidify your understanding of the platform and its capabilities.
Browse courses on Google Cloud Dataproc
Show steps
  • Identify a project idea.
  • Design and implement your project.
  • Share your project with others.
Contribute to an open-source project related to Google Cloud Dataproc
Contributing to an open-source project related to Google Cloud Dataproc would allow you to gain a deeper understanding of the platform and its ecosystem, while also contributing to the community.
Browse courses on Google Cloud Dataproc
Show steps
  • Find an open source project related to Google Cloud Dataproc.
  • Identify ways to contribute to the project.
  • Make your contributions and submit pull requests.

Career center

Learners who complete Dataproc: Qwik Start - Command Line will develop knowledge and skills that may be useful to these careers:
Data Engineer
A Data Engineer designs, builds, maintains, and analyzes data infrastructure to help an organization get value from its data.
Data Analyst
A Data Analyst analyzes data to identify trends and patterns in the data that may be used to improve decision-making within an organization.
Machine Learning Engineer
A Machine Learning Engineer designs, builds, deploys, and maintains machine learning applications and models.
Cloud Engineer
A Cloud Engineer designs, builds, deploys, and maintains cloud infrastructure and applications.
DevOps Engineer
A DevOps Engineer automates the process of software development, testing, and deployment.
Software Engineer
A Software Engineer designs, builds, and maintains software applications.
Data Scientist
A Data Scientist is an expert in data analysis and machine learning.
Database Administrator
A Database Administrator maintains and manages database systems.
Business Analyst
A Business Analyst analyzes business processes and develops solutions to improve efficiency and productivity.
Product Manager
A Product Manager is responsible for the development and management of a product or service.
Project Manager
A Project Manager plans, executes, and closes projects.
Technical Writer
A Technical Writer creates user documentation and other materials.
Technical Support Specialist
A Technical Support Specialist provides technical support to users.
Customer Success Manager
A Customer Success Manager is responsible for the success of a customer.
Sales Representative
A Sales Representative sells products or services.

Reading list

We've selected eight books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Dataproc: Qwik Start - Command Line.
Provides a comprehensive introduction to Apache Spark.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Dataproc: Qwik Start - Command Line.
Dataproc: Qwik Start - Console
Most relevant
APIs Explorer: Create and Update a Cluster
Most relevant
Introduction to Cloud Dataproc: Hadoop and Spark on...
Most relevant
Distributed Image Processing in Cloud Dataproc
Most relevant
Machine Learning with Spark on Google Cloud Dataproc
Most relevant
Architecting Big Data Solutions Using Google Dataproc
Most relevant
Cloud Composer: Qwik Start - Console
Most relevant
Cloud Composer: Qwik Start - Command Line
Most relevant
Building Realtime Pipelines in Cloud Data Fusion
Most relevant
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser