Source Code

Highlight

Azure Databricks is fast, easy to use and scalable big data collaboration platform. Based on Apache Spark brings high performance and benefits of spark without need of having high technical knowledge. You just write Python/Scala scripts and you are ready to go.

Intro

In this video I will cover basics of Databricks and show common Blob Storage JSON to Blob Storage CSV transformation scenario.

Code samples: https://github.com/MarczakIO/azure4everyone-samples/tree/master/azure-databricks-introduction

Agenda

Today we will cover

  1. Azure Databricks and Databricks platform Overview
  2. Key Features of Databricks
  3. Demo of Blob ingestion using Python and Spark SQL script and data visualisation
  4. Demo of Blob to Blob tranfromation using Scala and Spark SQL

Video

Final thoughts

Azure Databricks is a one of those hot topics right now. This introduction is seond in the series of data transformation in Azure. Stay tuned to see more.

Source Code

Adam Marczak

I've spent most of my career working with software and cloud technologies, but at heart I'm simply someone who loves learning new things and sharing what I discover. Through this blog and my Azure 4 Everyone YouTube channel, I try to make Azure and cloud computing more approachable for developers, architects, and anyone curious about technology.

Did you enjoy the article?

Support me

Join as member

Share it

More tagged posts