Thursday, December 26, 2019

Data Platform Tips 29 - Integrate Azure Data Lake Storage Gen 2 with Power BI


Image result for Azure Data Lake and Power BI"Data Scientists and Analysts can now integrate Azure Data Lake Storage Gen 2 data with Power BI to analyse the data stored in the Data Lake.


a) Use the Azure Storage Explorer to upload the sample drivers.txt file to the Azure Data Lake Storage Gen 2 storage.















b) Copy the primary key and DFS Endpoint by right-clicking on ADLS Gen2 storage account in the Azure Storage Explorer.

























c) Open Power BI Desktop and click on "Get Data" and select Azure > Data Lake Storage Gen 2

























d) Append the DFS endpoint copied in step b) with "adlsfilesystemdemo/drivers.txt" where adlsfilesystemdemo is the filesystem that was created on the Azure Data Lake Storage Gen 2.










e) Provide the access key that was copied in step b) for Power BI to access the file from the Azure Data Lake Storage Gen 2 and click "Connect".











f) Once connected to the ADLS Gen 2 Storage, you will be able to see the drivers.txt file.




















g) Click on "Combine" and select "Combine & Transform Data".




















h) Preview the contents of the file and click "ok"



















i) You can now rename "Query 1" with "Drivers" and columns appropriately as shown below. Also hide "Column 5" from Report View.










j) Finally you can create a bar chart showing the no. of drivers from each country using the data from ADLS Gen 2.
















This is how you can connect Power BI Desktop with Azure Data Lake Storage Gen 2 and perform Data Analysis.

No comments:

Post a Comment