How do I connect my Spark based HDInsight cluster to my blob storage

0 votes
I have created a blob storage earlier and HDInsight cluster earlier. Now I have requirements to connect and access blob storage from the HDinsight cluster. I haven’t done it before and I am not getting any tutorial which could help  to do that.

I have just created a Spark based HDInsight cluster. I have selected a blob storage that I created before, while creating the cluster. However, I have no idea how to access that blob storage from within the VM created there. I have read many different tutorials, but couldn't get a proper answer.

Can I add and access blob storage just like HDFS?
Apr 15, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,837 views

1 answer to this question.

0 votes
Go through this blog:

https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-use-blob-storage#access-blobs

I went through this official HDInsight Hadoop blog where I found how to access blobs in it.  It provides commands for using PowerShell to access data stored in blobs.

To know more I would recommend you to browse through this Github link:

https://github.com/Blackmist/hdinsight-tools
answered Apr 15, 2018 by Shubham
• 13,490 points

Related Questions In Big Data Hadoop

0 votes
1 answer

How do I compile my java program on Ubuntu such that it will refer to hadoop-2.2.0 libraries?

The simplest solution for Linux machines would ...READ MORE

answered Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
704 views
0 votes
1 answer

How do I get connected to Hadoop and Geo Spatial connector?

There are a number of free and ...READ MORE

answered Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,479 views
0 votes
1 answer

I want to install snappy on Hadoop 1.2.1. How do I do that?

As per Cloudera, if you install hadoop ...READ MORE

answered Dec 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
742 views
0 votes
1 answer

How to checkout Hadoop 2.6.0 from git

Clone the following Git repository: git clone git ...READ MORE

answered Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
523 views
+15 votes
2 answers

Git management technique when there are multiple customers and need multiple customization?

Consider this - In 'extended' Git-Flow, (Git-Multi-Flow, ...READ MORE

answered Mar 27, 2018 in DevOps & Agile by DragonLord999
• 8,450 points
3,091 views
0 votes
1 answer
0 votes
3 answers

How to connect Spark to a remote Hive server?

JDBC is not required here. Create a hive ...READ MORE

answered Mar 8, 2019 in Big Data Hadoop by Vijay Dixon
• 190 points
11,726 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP