We may earn an affiliate commission when you visit our partners.
Course image
Google Cloud Training
Este curso acelerado de una semana está basado en cursos anteriores de la especialización "Data Engineering on Google Cloud Platform". Mediante una serie de clases por video, demostraciones y labs prácticos, aprenderá a crear y administrar clústeres de...
Read more
Este curso acelerado de una semana está basado en cursos anteriores de la especialización "Data Engineering on Google Cloud Platform". Mediante una serie de clases por video, demostraciones y labs prácticos, aprenderá a crear y administrar clústeres de procesamiento para ejecutar trabajos de Hadoop, Spark, Pig o Hive en Google Cloud Platform.Además, aprenderá a acceder a varias opciones de almacenamiento en la nube desde sus clústeres de procesamiento y a integrar las capacidades del aprendizaje automático de Google en sus programas de estadísticas. En los labs prácticos, creará y administrará clústeres de Dataproc con la consola web y la CLI. Luego, usará los clústeres para ejecutar trabajos de Spark y Pig. A continuación, creará notebooks de IPython que se integran con BigQuery y el almacenamiento, y utilizará Spark. Por último, integrará las API de aprendizaje automático en el análisis de sus datos. Requisitos previos • Google Cloud Platform Big Data & Machine Learning Fundamentals (o contar con experiencia equivalente) • Conocimientos de Python
Enroll now

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Will improve your background in Hadoop, Spark, Pig, and Hive
Geared toward individuals with foundational to intermediate-level knowledge of Google Cloud Platform and Python
Can be a cornerstone for those interested in enhancing their data engineering capabilities with Google Cloud Platform technology
In-demand skills in data engineering and data science
Part of a larger specialization in data engineering on Google Cloud Platform, indicating a well-rounded approach
Taught by Google Cloud Training, who are experts in cloud technologies

Save this course

Save Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Español to your list so you can find it easily later:
Save

Reviews summary

Meaningful learning with dataproc en español

This course leverages unstructured data using Google Cloud Platform and Big Data tools like Cloud Dataproc, Spark, Pig, and Hive. Students can expect this course to teach them how to create and administer cluster processing to specifically execute jobs with Hadoop. The course is a fast-paced one-week lesson within a larger specialization on data engineering. Though some students found that half of the material was more performative and convincing than educational, students overall had a positive experience and thought the course was interesting and fun with lots of hands-on learning. There are three main pain points students have mentioned with the course. First, the course is in Spanish, which makes it inaccessible to non-Spanish learners or students without Spanish fluency. Second, the labs have some repeated content, and some links redirect users to the wrong pages or break the sequence, which can be frustrating. Finally, the course is project-heavy, and students report spending a significant amount of time outside of the course working on projects. Despite these shortcomings, students have reported that they learned a lot in a short amount of time and found the hands-on learning experience to be invaluable. If you are comfortable taking a course entirely in Spanish and are looking to learn more about unstructured data processing within Google Cloud Platform, you should consider enrolling in this course.
Fast-paced, one-week course
"This course is a fast-paced, one-week lesson within a larger specialization on data engineering."
Demos make course concepts easier to understand.
"Really cool course with fun little demos."
Lots of in-depth, hands-on learning experiences.
"In the hands-on labs, you'll create and manage Dataproc clusters with the web console and CLI."
"Then, you'll use the clusters to execute Spark and Pig jobs."
Course spends too much time promoting GCP.
"Half the material is spent convincing individuals on why GCP is a better option than what others currently use"
Coursework is project-heavy.
Repetitive or similar labs within the course
"I had repeated labs or very similar ones."
Broken links and wrong page redirects in course labs
"The links of the labs do not work correctly."
Course is entirely in Spanish.

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Español with these activities:
Create a Jupyter Notebook
Create a Jupyter notebook that integrates BigQuery and Cloud Storage to demonstrate the concepts covered in class.
Show steps
  • Create a new Jupyter notebook.
  • Import the required libraries.
  • Read data from BigQuery.
  • Write data to Cloud Storage.
Follow Spark & Pig Tutorials
Follow guided tutorials on Spark and Pig to supplement the concepts covered in class.
Show steps
  • Identify relevant tutorials online.
  • Follow the tutorials step-by-step.
  • Complete the exercises provided in the tutorials.
Start a Big Data Project
Start a project that involves working with Big Data to apply the concepts covered in class.
Show steps
  • Identify a problem or opportunity that can be solved using Big Data.
  • Develop a project proposal.
  • Start working on the project.
Four other activities
Expand to see all activities and additional details
Show all seven activities
Practice Spark & Pig Queries
Practice writing Spark and Pig queries to reinforce concepts covered in class.
Show steps
  • Create a sample dataset.
  • Write Spark and Pig queries to analyze the dataset.
  • Compare the results of the queries.
Mentor Students
Help other students in the class by providing guidance and support.
Show steps
  • Identify students who may need help.
  • Offer your assistance.
  • Provide guidance and support.
Attend a Big Data Workshop
Attend a workshop on Big Data to learn about the latest trends and technologies in the field.
Show steps
  • Identify relevant workshops online or in your area.
  • Register for the workshop.
  • Attend the workshop and actively participate.
Build a Data Pipeline
Build a complete data pipeline from data ingestion to data analysis to demonstrate the concepts covered in class.
Show steps
  • Design the data pipeline architecture.
  • Implement the data pipeline using the appropriate tools and technologies.
  • Test and deploy the data pipeline.

Career center

Learners who complete Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Español will develop knowledge and skills that may be useful to these careers:
Data Engineer
A Data Engineer is a professional who builds, maintains, and analyzes large data sets. They use a variety of tools and techniques to extract insights from data and help businesses make better decisions. This course provides a foundation in the tools and techniques that are used by Data Engineers, which can be helpful for those who want to enter or advance in this field.
Data Scientist
Data Scientists use their expertise in math, statistics, and computer science to extract insights from data. They work with businesses to solve problems and make better decisions. This course provides a foundation in the tools and techniques that are used by Data Scientists, which can be helpful for those who want to enter or advance in this field.
Big Data Analyst
Big Data Analysts are responsible for collecting, cleaning, and analyzing large data sets. They use their findings to help businesses make better decisions. This course provides a foundation in the tools and techniques that are used by Big Data Analysts, which can be helpful for those who want to enter or advance in this field.
Machine Learning Engineer
Machine Learning Engineers develop and deploy machine learning models. They work with businesses to solve problems and make better decisions. This course provides a foundation in the tools and techniques that are used by Machine Learning Engineers, which can be helpful for those who want to enter or advance in this field.
Cloud Engineer
Cloud Engineers are responsible for designing, building, and maintaining cloud-based solutions. They work with businesses to help them move their applications and data to the cloud. This course provides a foundation in the tools and techniques that are used by Cloud Engineers, which can be helpful for those who want to enter or advance in this field.
Data Architect
Data Architects design and build data management systems. They work with businesses to help them manage their data effectively. This course provides a foundation in the tools and techniques that are used by Data Architects, which can be helpful for those who want to enter or advance in this field.
Database Administrator
Database Administrators are responsible for managing and maintaining databases. They work with businesses to ensure that their data is secure and accessible. This course provides a foundation in the tools and techniques that are used by Database Administrators, which can be helpful for those who want to enter or advance in this field.
Data Analyst
Data Analysts are responsible for collecting, cleaning, and analyzing data. They use their findings to help businesses make better decisions. This course provides a foundation in the tools and techniques that are used by Data Analysts, which can be helpful for those who want to enter or advance in this field.
Business Analyst
Business Analysts work with businesses to identify and solve problems. They use their knowledge of data and business processes to help businesses make better decisions. This course provides a foundation in the tools and techniques that are used by Business Analysts, which can be helpful for those who want to enter or advance in this field.
Software Engineer
Software Engineers design, develop, and test software. They work with businesses to create software that meets their needs. This course provides a foundation in the tools and techniques that are used by Software Engineers, which can be helpful for those who want to enter or advance in this field.
Computer Scientist
Computer Scientists study the theory and practice of computation. They work with businesses to develop new technologies and solve problems. This course provides a foundation in the tools and techniques that are used by Computer Scientists, which can be helpful for those who want to enter or advance in this field.
Statistician
Statisticians collect, analyze, and interpret data. They work with businesses to help them make better decisions. This course provides a foundation in the tools and techniques that are used by Statisticians, which can be helpful for those who want to enter or advance in this field.
Operation Research Analyst
Operation Research Analysts use mathematical and analytical techniques to solve problems. They work with businesses to help them make better decisions. This course provides a foundation in the tools and techniques that are used by Operation Research Analysts, which can be helpful for those who want to enter or advance in this field.
Financial Analyst
Financial Analysts use their knowledge of finance and economics to help businesses make better decisions. This course provides a foundation in the tools and techniques that are used by Financial Analysts, which can be helpful for those who want to enter or advance in this field.
Project Manager
Project Managers lead and manage projects. They work with businesses to help them achieve their goals. This course provides a foundation in the tools and techniques that are used by Project Managers, which can be helpful for those who want to enter or advance in this field.

Reading list

We've selected nine books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Español.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Español.
Herramientas para la ciencia de datos
Most relevant
ML Pipelines on Google Cloud en Español
Most relevant
Smart Analytics, Machine Learning, and AI on GCP en...
Most relevant
Serverless Machine Learning con TensorFlow en GCP
Most relevant
Google Cloud Product Fundamentals en Español
Most relevant
Google Docs en Español
Most relevant
Creatividad en la bandeja de entrada: marketing por...
Most relevant
Machine Learning in the Enterprise - Español
Most relevant
Introducción a R para ciencia de datos
Most relevant
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser