Implement a Data Analytics Solution with Azure Databricks

Codice

DP-3011

Durata

1 Giorno

Prezzo

670,00 € (iva escl.)

Lingua

Italiano

Modalità

Virtual Classroom
Corso in aula

Schedulazione

Luogo	Data	Iscrizione
A Richiesta

Prerequisiti

Before starting this learning path, you should already be comfortable with the fundamentals of Python and SQL.

This includes being able to write simple Python scripts and work with common data structures, as well as writing SQL queries to filter, join, and aggregate data.

A basic understanding of common file formats such as CSV, JSON, or Parquet will also help when working with datasets.

In addition, familiarity with the Azure portal and core services like Azure Storage is important, along with a general awareness of data concepts such as batch versus streaming processing and structured versus unstructured data.

While not mandatory, prior exposure to big data frameworks like Spark, and experience working with Jupyter notebooks, can make the transition to Databricks smoother.

Obiettivi

By the end of this learning path, you'll have built solid intermediate to advanced skills in both Databricks and Spark on Azure.

You're able to ingest, transform, and analyze large-scale datasets using Spark DataFrames, Spark SQL, and PySpark, giving you confidence in working with distributed data processing.

Within Databricks, you know how to navigate the workspace, manage clusters, and build and maintain Delta tables.

You'll also be capable of designing and running ETL pipelines, optimizing Delta tables, managing schema changes, and applying data quality rules. In addition, you learn how to orchestrate workloads with Lakeflow Jobs and pipelines, enabling you to move from exploration to automated workflows.

Finally, you gain familiarity with governance and security features, including Unity Catalog, Purview integration, and access management, preparing you to operate effectively in production-ready data environments.

Contenuti

Explore Azure Databricks

Perform data analysis with Azure Databricks

Use Apache Spark in Azure Databricks

Manage data with Delta Lake

Build Lakeflow Declarative Pipelines

Deploy workloads with Lakeflow Jobs