Hello and welcome everybody to our TNWiki Article Spotlight on Tuesday.
Known as someone who tries to represents the Azure enthusiasts in the wiki ninjas authoring team I have another piece of sugar for you.
But we came a long way!
After Hadoop-based Services for Windows. Introduction to HDInsight Services for Windows Azure, and Microsoft HDInsight (Big Data) Solution, I'm happy to announce the next article in the HDInsight family – Using an HDInsight Cluster with Alternate Storage Accounts and Metastores.
Eric N. Hanson, creator of this article, gives you a short overview about some default behavior of HDInsight regarding its storage accounts.
After this short introduction he provides a PowerShell scripts how you can submit a Hive job and specify a per-job metastore database and a per-job storage account.
The same behavior is shown for MapReduce and MapReduce Streaming and Pig. He ends with a short paragraph about best practices regarding the things you have just seen.
If you are working with HDInsight this is a must-read for you!