How do I connect my Spark based HDInsight cluster to my blob storage?

0 votes
I have created a blob storage earlier and HDInsight cluster earlier. Now I have requirements to connect and access blob storage from the HDinsight cluster. I haven’t done it before and I am not getting any tutorial which could help  to do that.

I have just created a Spark based HDInsight cluster. I have selected a blob storage that I created before, while creating the cluster. However, I have no idea how to access that blob storage from within the VM created there. I have read many different tutorials, but couldn't get a proper answer.

Can I add and access blob storage just like HDFS?
Apr 15, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
575 views

1 answer to this question.

0 votes
Go through this blog:

https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-use-blob-storage#access-blobs

I went through this official HDInsight Hadoop blog where I found how to access blobs in it.  It provides commands for using PowerShell to access data stored in blobs.

To know more I would recommend you to browse through this Github link:

https://github.com/Blackmist/hdinsight-tools
answered Apr 15, 2018 by Shubham
• 13,290 points

Related Questions In Big Data Hadoop

0 votes
1 answer

How do I compile my java program on Ubuntu such that it will refer to hadoop-2.2.0 libraries?

The simplest solution for Linux machines would ...READ MORE

answered Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,810 points
36 views
0 votes
1 answer

How do I get connected to Hadoop and Geo Spatial connector?

There are a number of free and ...READ MORE

answered Aug 13, 2018 in Big Data Hadoop by Frankie
• 9,810 points
190 views
0 votes
1 answer

I want to install snappy on Hadoop 1.2.1. How do I do that?

As per Cloudera, if you install hadoop ...READ MORE

answered Dec 11, 2018 in Big Data Hadoop by Frankie
• 9,810 points
62 views
0 votes
1 answer

How to checkout Hadoop 2.6.0 from git

Clone the following Git repository: git clone git ...READ MORE

answered Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
71 views
+13 votes
2 answers

Git management technique when there are multiple customers and need multiple customization?

Consider this - In 'extended' Git-Flow, (Git-Multi-Flow, ...READ MORE

answered Mar 26, 2018 in DevOps & Agile by DragonLord999
• 8,380 points
124 views
0 votes
1 answer

How can I use my host machine’s web browser to check my HDFS services running in the VM?

The sole purpose of the virtual machine ...READ MORE

answered Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,290 points
119 views
0 votes
3 answers

How to connect Spark to a remote Hive server?

JDBC is not required here. Create a hive ...READ MORE

answered Mar 8 in Big Data Hadoop by Vijay Dixon
• 180 points
1,166 views