TNWiki Article Spotlight – Using an HDInsight Cluster with Alternate Storage Accounts and Metastores

Hello and welcome everybody to our TNWiki Article Spotlight on Tuesday.

Known as someone who tries to represents the Azure enthusiasts in the wiki ninjas authoring team I have another piece of sugar for you.

But we came a long way!

After Hadoop-based Services for Windows. Introduction to HDInsight Services for Windows Azure, and Microsoft HDInsight (Big Data) Solution, I'm happy to announce the next article in the HDInsight family - Using an HDInsight Cluster with Alternate Storage Accounts and Metastores.

Eric N. Hanson, creator of this article, gives you a short overview about some default behavior of HDInsight regarding its storage accounts.

After this short introduction he provides a PowerShell scripts how you can submit a Hive job and specify a per-job metastore database and a per-job storage account.

The same behavior is shown for MapReduce and MapReduce Streaming and Pig. He ends with a short paragraph about best practices regarding the things you have just seen.

If you are working with HDInsight this is a must-read for you!

- German Ninja Jan (TwitterBlogProfile)

Skip to main content