We may earn an affiliate commission when you visit our partners.
Course image
Nestor Nicolas Campos Rojas

En este proyecto de 1 hora, aprenderás a aplicar buenas prácticas bajo el contexto de procesamiento Big Data, utilizando una de las plataformas más importantes en la actualidad, Databricks.

Además, podrás analizar las mejores opciones y librerías para la manipulación de datos sobre dataframes de Spark.

Enroll now

What's inside

Syllabus

Análisis de documentos con servicios cognitivos de Azure
Al final de este proyecto, tú entenderás y aplicarás las mejores prácticas para procesar datos de forma masiva en un ambiente de Big Data, específicamente con la tecnología Databricks y Spark.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Se centra en prácticas recomendadas para el procesamiento de Big Data, específicamente con Databricks y Spark, que son tecnologías muy utilizadas en la industria
Desarrolla habilidades en el manejo de conjuntos de datos de Spark, que son esenciales para el análisis de Big Data
El instructor, Nestor Nicolas Campos Rojas, no es muy conocido
El proyecto tiene una duración de 1 hora, lo cual puede ser una limitación para cubrir completamente el tema
No menciona explícitamente la relevancia de las habilidades aprendidas en el mundo laboral o académico

Save this course

Save Mejores prácticas para el procesamiento de datos en Big Data to your list so you can find it easily later:
Save

Reviews summary

Big data practices project

This course has been reviewed very positively by its participants. Responses indicate that although the course would be improved with the inclusion of some introductory material, the overall quality of the course is high. The host instructor is also praised for their teaching style.
Good instructor.
"The instructor is very good explaining."
"Muchas gracias por tan buen proyecto."
High quality instruction.
"The course is very good..."
"The instructor is very good explaining."
Could be improved with a more detailed explanation of Databricks.
"He pagado el curso y no he sido capaz de registrame en Databricks según las indicaciones del curso ya que no lo explican, he tenido que poner datos bancarios para registrarme y hacer un montón de pasos que dan por obvios para registrarse teniendo que pasar luego una odisea para cancelar las subscripciones."
Could be improved with more introductory material.
"Sólo sería de gran utilidad un capítulo introductorio donde se expliquen la forma de instalar las tecnologías empleadas en Databricks pero en nuestras computadoras."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Mejores prácticas para el procesamiento de datos en Big Data with these activities:
Set Up Your Databricks Workspace
Ensure you have a properly configured workspace to maximize your productivity.
Show steps
  • Create a Databricks account
  • Create a Databricks cluster
  • Install Databricks CLI
Data Wrangling with Spark DataFrames
Develop proficiency in transforming and manipulating data using Spark DataFrames.
Browse courses on Data Wrangling
Show steps
  • Import data into a DataFrame
  • Perform data transformations
  • Handle missing values
  • Aggregate and group data
Analyze Data with Azure Cognitive Services
Enhance your analytical capabilities by leveraging Azure Cognitive Services for tasks like image recognition and text processing.
Browse courses on Azure Cognitive Services
Show steps
  • Set up Azure Cognitive Services
  • Integrate Cognitive Services into Spark
  • Use Cognitive Services for image analysis
  • Use Cognitive Services for text analysis
Four other activities
Expand to see all activities and additional details
Show all seven activities
Develop a Big Data Processing Pipeline
Design and implement a comprehensive pipeline to process large datasets efficiently and effectively.
Browse courses on Big Data Processing
Show steps
  • Define the pipeline architecture
  • Implement the pipeline using Spark
  • Deploy and monitor the pipeline
Participate in a Databricks Hackathon
Showcase your skills and collaborate with others to solve real-world Big Data challenges.
Browse courses on Databricks
Show steps
  • Find a hackathon
  • Form a team
  • Develop a solution
  • Submit your solution
Mentor Junior Data Engineers
Share your knowledge and experience with aspiring data engineers to foster professional growth within the field.
Browse courses on Mentoring
Show steps
  • Join a mentorship program
  • Connect with mentees
  • Provide guidance and support
Engage with Experienced Big Data Professionals
Connect with industry experts to gain valuable insights and expand your professional network.
Browse courses on Mentorship
Show steps
  • Attend industry events
  • Join online communities
  • Reach out to individuals directly

Career center

Learners who complete Mejores prácticas para el procesamiento de datos en Big Data will develop knowledge and skills that may be useful to these careers:
Data Scientist
A Data Scientist uses advanced statistical techniques to solve problems. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in data science. This course may be particularly useful for data scientists who work with large datasets.
Business Analyst
A Business Analyst uses data to solve business problems. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in business analysis. This course may be particularly useful for business analysts who work with large datasets.
Data Architect
A Data Architect designs and builds data architectures. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in data architecture. This course may be particularly useful for data architects who work with large datasets.
Software Engineer
A Software Engineer designs and develops software applications. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in software development. This course may be particularly useful for software engineers who work with large datasets.
Data Analyst
A Data Analyst cleans and analyzes data to find patterns and trends. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in data analysis. This course may be particularly useful for data analysts who work with large datasets.
Database Administrator
A Database Administrator manages and maintains databases. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often stored in databases. This course may be particularly useful for database administrators who work with large databases.
Financial Analyst
A Financial Analyst uses data to help businesses make financial decisions. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in financial analysis. This course may be particularly useful for financial analysts who work with large datasets.
Marketing Analyst
A Marketing Analyst uses data to help businesses make marketing decisions. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in marketing analysis. This course may be particularly useful for marketing analysts who work with large datasets.
Risk Analyst
A Risk Analyst uses data to help businesses identify and manage risks. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in risk analysis. This course may be particularly useful for risk analysts who work with large datasets.
Data Engineer
A Data Engineer designs and builds data pipelines. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in data engineering. This course may be particularly useful for data engineers who work with large datasets.
Cloud Architect
A Cloud Architect designs and builds cloud computing solutions. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often stored in the cloud. This course may be particularly useful for cloud architects who work with large datasets.
Government Analyst
A Government Analyst uses data to help government agencies make decisions. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in government analysis. This course may be particularly useful for government analysts who work with large datasets.
Healthcare Analyst
A Healthcare Analyst uses data to help healthcare providers make decisions. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data, which is often used in healthcare analysis. This course may be particularly useful for healthcare analysts who work with large datasets.
Data Management Specialist
A Data Management Specialist manages and maintains data. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data. This course may be particularly useful for data management specialists who work with large datasets.
Big Data Developer
A Big Data Developer designs and develops big data solutions. The Mejores prácticas para el procesamiento de datos en Big Data course can help someone in this field by providing them with the skills to process big data. This course may be particularly useful for big data developers who work with large datasets.

Reading list

We've selected nine books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Mejores prácticas para el procesamiento de datos en Big Data.
Esta es una guía completa de Apache Spark, escrita por sus creadores. Brinda información detallada sobre la arquitectura, las API y las mejores prácticas de Spark, lo que lo convierte en una lectura invaluable para usuarios avanzados que buscan una comprensión profunda.
Esta es una referencia integral de Apache Spark que abarca todos sus componentes. Proporciona una comprensión profunda de la arquitectura, los algoritmos y las extensiones de Spark, lo que lo hace ideal para usuarios experimentados que buscan un recurso extenso.
Is written by one of the originators of Apache Spark and offers a comprehensive view of the framework, including foundational concepts, architecture, and various use cases.
Can help readers develop an understanding of advanced Spark techniques that enable the manipulation of big data in optimized ways. It is written by developers of Apache Spark and comprehensive guide for professionals.
Este libro explora técnicas avanzadas de análisis de datos utilizando Spark. Cubre temas como aprendizaje profundo, análisis de gráficos y procesamiento de lenguaje natural, brindando información valiosa para aquellos que buscan ir más allá de las tareas básicas de análisis.
Este libro brinda una introducción integral a Apache Spark, que abarca desde los conceptos básicos hasta los casos de uso avanzados. Proporciona una base sólida para comprender el procesamiento de datos a gran escala y el uso de Spark para resolver problemas de Big Data.
Dives into Apache Spark's architecture, deployment, and development of scalable applications. It will be most useful for readers who are already familiar with big data concepts.
Will be useful to those who are interested in learning about Apache Spark, including its various components such as Spark Core, Spark SQL, Spark Streaming, and more.
Si bien Hadoop no es el tema principal de este curso, este libro proporciona información valiosa sobre el marco subyacente que sustenta Apache Spark. Ofrece una base sólida para comprender el ecosistema de Big Data.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Mejores prácticas para el procesamiento de datos en Big Data.
Fundamentos TIC para profesionales de negocios:...
Psicología de la salud
Pensamiento crítico: toma de decisiones razonadas
Pensamiento crítico: toma de decisiones razonadas
Fundamentos de Comunicaciones Ópticas
Introducción a la gestión ágil de proyectos
Trabajo en equipo en el ámbito jurídico
La España del Quijote
Fotografía en Latinoamérica: historia, imágenes y espacios
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser