Data Lake Storage Gen 2 is the best storage solution for big data analytics in Azure. With its Hadoop compatible access, it is a perfect fit for existing platforms like Databricks, Cloudera, Hortonworks, Hadoop, HDInsight and many more. Take advantage of both blob storage and data lake in one service!

Intro

In this episode I give you introduction to what Azure Data Lake Storage is, how it works and how can you leverage it in your big data workloads. I will also explain the differences between Blob and ADLS.

Agenda

In a short demo I will show you

  • What is Data Lake Storage and how it works and why is it called Gen2?
  • What does it mean being designed for big data analytical workloads?
  • How does multi-protocol access work?
  • What are key differences between ADLS and Blob Storage?
  • Quick demo of creating ADLS in portal
  • Quick demo of connecting from Power BI and using multi-protocol access
  • How to use storage explorer with ADLS
  • How do Access Control Lists work and how to manage them
  • Demo with Databricks and ADLS

Sample code from demo: https://pastebin.com/ee7ULpwx

Video

Next steps for you after watching the video

  1. Azure Data Lake Storage documentation
  1. Transform data using Databricks and ADLS demo tutorial
  1. More on multi-protocol access
  1. Read more on ACL

Adam Marczak

I've spent most of my career working with software and cloud technologies, but at heart I'm simply someone who loves learning new things and sharing what I discover. Through this blog and my Azure 4 Everyone YouTube channel, I try to make Azure and cloud computing more approachable for developers, architects, and anyone curious about technology.

Did you enjoy the article?

Support me

Join as member

Share it

More tagged posts