How do I connect my Spark based HDInsight cluster to my blob storage

0 votes
I have created a blob storage earlier and HDInsight cluster earlier. Now I have requirements to connect and access blob storage from the HDinsight cluster. I haven’t done it before and I am not getting any tutorial which could help  to do that.

I have just created a Spark based HDInsight cluster. I have selected a blob storage that I created before, while creating the cluster. However, I have no idea how to access that blob storage from within the VM created there. I have read many different tutorials, but couldn't get a proper answer.

Can I add and access blob storage just like HDFS?
Apr 15, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,922 views

1 answer to this question.

0 votes
Go through this blog:

https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-use-blob-storage#access-blobs

I went through this official HDInsight Hadoop blog where I found how to access blobs in it.  It provides commands for using PowerShell to access data stored in blobs.

To know more I would recommend you to browse through this Github link:

https://github.com/Blackmist/hdinsight-tools
answered Apr 15, 2018 by Shubham
• 13,490 points

Related Questions In Big Data Hadoop

0 votes
1 answer

How do I compile my java program on Ubuntu such that it will refer to hadoop-2.2.0 libraries?

The simplest solution for Linux machines would ...READ MORE

answered Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
877 views
0 votes
1 answer

How do I get connected to Hadoop and Geo Spatial connector?

There are a number of free and ...READ MORE

answered Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,665 views
0 votes
1 answer

I want to install snappy on Hadoop 1.2.1. How do I do that?

As per Cloudera, if you install hadoop ...READ MORE

answered Dec 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
816 views
0 votes
1 answer

How to checkout Hadoop 2.6.0 from git

Clone the following Git repository: git clone git ...READ MORE

answered Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
608 views
+15 votes
2 answers

Git management technique when there are multiple customers and need multiple customization?

Consider this - In 'extended' Git-Flow, (Git-Multi-Flow, ...READ MORE

answered Mar 27, 2018 in DevOps & Agile by DragonLord999
• 8,450 points
3,460 views
0 votes
1 answer

How can I use my host machine’s web browser to check my HDFS services running in the VM?

The sole purpose of the virtual machine ...READ MORE

answered Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,068 views
0 votes
3 answers

How to connect Spark to a remote Hive server?

JDBC is not required here. Create a hive ...READ MORE

answered Mar 8, 2019 in Big Data Hadoop by Vijay Dixon
• 190 points
12,097 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP