We may earn an affiliate commission when you visit our partners.
Pluralsight logo

Working with Azure Databricks Programmatically

Kishan Iyer

To carry out large-scale operations on Databricks, you'll need to develop apps or scripts which can interact with this big data service. This course looks into the Databricks CLI, its REST API, and the dbutils library to automate such interactions.

Read more

To carry out large-scale operations on Databricks, you'll need to develop apps or scripts which can interact with this big data service. This course looks into the Databricks CLI, its REST API, and the dbutils library to automate such interactions.

While nearly every task in Databricks can be accomplished from its Web UI, to perform operations of any complexity at scale, you need to interact with this service from a script or an application. In this course, Working with Azure Databricks Programmatically, you'll learn about the Databricks command-line interface (CLI). First, you'll explore how we can create and use a personal access token for authentication, and how to construct CLI commands to perform a variety of workspace operations. Then, you'll explore the use of the Databricks REST API. Finally, you'll discover the versatility of the dbutils library in order to interact with the Databricks file system from a Python application. Once you complete this course, you'll have a clear understanding of how interactions with a Databricks service can be automated using the Databricks CLI, the REST API, and the dbutils package.

Enroll now

What's inside

Syllabus

Course Overview
Accessing Azure Databricks with the CLI
Using the Azure Databricks REST API
Managing an Azure Databricks Workspace with dbutils
Read more

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Taught by Kishan Iyer, who are recognized for their work in Azure Databricks
This course is designed for individuals who have experience with Azure Databricks and want to learn how to automate interactions with this service using the CLI, REST API, and dbutils library
Helps learners use the Databricks command-line interface (CLI) to create and use a personal access token for authentication, and how to construct CLI commands to perform a variety of workspace operations
Develops understanding of how to use the Azure Databricks REST API and the dbutils library to interact with the Databricks file system from a Python application
Suitable for students with a good understanding of Azure Databricks and Python programming
Additional resources may be required for a deeper understanding of the concepts covered in this course

Save this course

Save Working with Azure Databricks Programmatically to your list so you can find it easily later:
Save

Activities

Coming soon We're preparing activities for Working with Azure Databricks Programmatically. These are activities you can do either before, during, or after a course.

Career center

Learners who complete Working with Azure Databricks Programmatically will develop knowledge and skills that may be useful to these careers:
Database Engineer
A Database Engineer helps maximize the performance of an organization's database. A significant part of this involves data integration, which is critical for the effective use of data in data science and analytics. The REST API skill taught in this course directly applies to data integration. A background in Azure Databricks would thus be particularly relevant to someone in this role, especially for working with large datasets.
Software Engineer
A Software Engineer designs, develops, deploys, and maintains software applications. Those involved in big data are responsible for working with large datasets, which requires expertise in parallel processing and distributed systems. These concepts are covered in this course, making it helpful for Software Engineers entering the field of big data processing.
Data Engineer
Data Engineers design, construct, and maintain the infrastructure used to store and process data. They are responsible for data modeling, data integration, and data security. The REST API skill taught in this course directly applies to data integration. A background in Azure Databricks would thus be particularly relevant to someone in this role, especially for working with large datasets.
Data Architect
A Data Architect designs and manages an organization's data architecture, which involves managing data models, data quality, and data security. As data models are used to represent complex systems, this role requires an understanding of software engineering. The skills taught in this course, particularly in the REST API, can help a Data Architect meet the demands of designing a data architecture for big data systems.
Machine Learning Engineer
Machine Learning Engineers build, deploy, and maintain machine learning models. As many machine learning applications involve working with very large datasets, distributed computing is essential. This course covers the concepts of parallel processing and distributed systems, and can be beneficial for Machine Learning Engineers entering the field of big data processing.
Artificial Intelligence Engineer
Artificial Intelligence Engineers design, develop, and maintain artificial intelligence systems. These can involve working with very large datasets. This course covers the concepts of parallel processing and distributed systems, and can be beneficial for Artificial Intelligence Engineers entering the field of big data processing.
Business Intelligence Analyst
Business Intelligence Analysts translate raw data into actionable insights for businesses. This typically involves working with data from different sources, which requires proficiency in data integration. The REST API skill taught in this course directly applies to data integration.
Data Scientist
Data Scientists apply scientific methods to extract knowledge from data. Data is often stored in a variety of formats and locations, making data integration a critical part of the job. This course teaches REST API, which can be useful for integrating data.
Software Development Manager
A Software Development Manager leads a team of software engineers. They are responsible for planning, organizing, and directing development efforts. A strong understanding of software development best practices is essential for a Software Development Manager. This course can help someone in this role better understand the challenges and opportunities of big data processing, enabling them to make better decisions for their team.
Data Analyst
A Data Analyst collects and analyzes data to identify trends and patterns. Much of this data is stored in databases. This course teaches the CLI, which can be useful for managing databases.
IT Project Manager
An IT Project Manager plans, executes, and closes IT projects. They are responsible for ensuring that projects are completed on time, within budget, and to the required quality standards. A strong understanding of software development best practices is essential for an IT Project Manager. This course can help someone in this role better understand the challenges and opportunities of big data processing, enabling them to make better decisions for their team.
DevOps Engineer
DevOps Engineers are responsible for bridging the gap between development and operations teams. They work to improve communication and collaboration, and to ensure that software is deployed and maintained efficiently. A strong understanding of software development best practices is essential for a DevOps Engineer. This course can help someone in this role better understand the challenges and opportunities of big data processing, enabling them to make better decisions for their team.
Database Administrator
Database Administrators are responsible for the installation, configuration, maintenance, and performance of database systems. They ensure that databases are available, reliable, and secure. This course teaches the CLI, which can be useful for managing databases.
Product Manager
Product Managers are responsible for the conception, development, and marketing of products. They work with engineers, designers, and marketers to ensure that products meet the needs of customers. A strong understanding of software development best practices is essential for a Product Manager. This course can help someone in this role better understand the challenges and opportunities of big data processing, enabling them to make better decisions for their team.
Technical Architect
Technical Architects design and implement the technical infrastructure of an organization. They work with business leaders to understand their needs and to develop solutions that meet those needs. A strong understanding of software development best practices is essential for a Technical Architect. This course can help someone in this role better understand the challenges and opportunities of big data processing, enabling them to make better decisions for their team.

Reading list

We've selected six books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Working with Azure Databricks Programmatically.
Can serve as a basic primer for those new to Spark or big data.
For those working with Python, this book serves as a useful primer and reference for the Pandas library
May be useful for those interested in the intersection of machine learning and data analysis with Python.
May be helpful for those looking for a more high-level view of data science.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Working with Azure Databricks Programmatically.
Administering Clusters and Configuring Policies with...
Most relevant
Managing and Administering the Databricks Service
Most relevant
Developing on AWS
Most relevant
Automating Cisco ASA and Firepower Policies Using APIs
Most relevant
Integrating SQL and ETL Tools with Databricks
Java EE 7: Getting Started
Data Engineering using Databricks on AWS and Azure
Getting Started with Apache Spark on Databricks
Working with Data in PowerShell
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser