Recent Updates to the Microsoft Data Science Virtual Machine

Posted by Gopi Kumar, Principal Program Manager in the Microsoft Data Group. It’s been over 9 months since we first released the Data Science Virtual Machine (DSVM), a custom virtual machine image we published in the Azure Marketplace with a host of popular data science tools pre-installed and pre-configured. We’ve made a few updates since… Read more

What Is Your Data Science Super Power?

This post is authored by Wee Hyong Tok, Senior Data Scientist Manager, and Danielle Dean, Senior Data Scientist Lead, at Microsoft. Wee and Danielle are speakers at the upcoming Microsoft Data Science Summit on September 26-27 in Atlanta, GA. How do businesses and data scientists turn raw data into intelligent action? Why do some companies… Read more

ICYMI: How to Do Data Science

In case you missed it: Earlier this year, Brandon Rohrer, Senior Data Scientist in the Data Group at Microsoft, published a popular blog post titled How to Do Data Science. Brandon is one of our speakers at the upcoming Microsoft Data Science Summit, a two-day intensive event for data scientists, big data engineers, ML practitioners… Read more

Register Now for the First Microsoft Data Science Summit

Posted by Joseph Sirosh, Corporate Vice President of the Data Group at Microsoft. I am super excited about the very first Microsoft Data Science Summit, to be held in Atlanta on September 26-27, 2016, and invite all data scientists, big data engineers and machine learning practitioners among you to attend. The Summit – which features… Read more

End-to-End Data Science Using Spark on Azure HDInsight

Introduction As a part of the Microsoft Data Science Process, we’ve created a comprehensive walkthrough using pySpark and MLlib to demonstrate how to conduct end-to-end data science on Azure HDInsight Spark clusters. With detailed examples and pySpark code that are accessible publicly from a GitHub repository, we highlight how to: Easily provision a managed Azure… Read more

Microsoft Makes Big Data and Analytics Easier in the Cloud

This post is by Joseph Sirosh, Corporate Vice President of the Data Group at Microsoft. This week I’m joining thousands of people attending Strata + Hadoop World in San Jose to explore the technology and business of big data and data science. As part of our participation in the conference, we are announcing several important… Read more

Free Webinar: Building A Scalable Data Science Platform with R and Hadoop

Hadoop is famously scalable. Cloud computing is famously scalable. But R – the preferred software and lingua franca of data scientists worldwide – not so much. But what if we seamlessly combined Hadoop with the cloud and R to create a scalable data science platform? Imagine exploring, transforming, modeling, and scoring data at any scale… Read more

Free Webinar: Best Practices for using Microsoft R Server with Hadoop

R is the world’s most widely used programming language for data analysis, and Hadoop is a fast growing infrastructure for storing and manipulating extremely large datasets. In this free webinar we will discuss how Microsoft R Server can be used with Hadoop, and several best practices on using these technologies in conjunction, including installation, software… Read more

Free Webinar: What is Azure Data Lake?

Big Data and Data Warehousing are at the core of any data platform discussion these days. Tune into this free webinar to learn about building end-to-end Big Data solutions using Azure Data Lake, a new offering that’s a critical element of the Microsoft data platform story and part of the Cortana Analytics Suite. Join the… Read more

REEF Graduates to a Top-Level Apache Project

This post is authored by Markus Weimer, Principal Scientist, Beysim Sezgin, Principal Engineer, and Hiren Patel, Program Manager, all at Microsoft. Raghu Ramakrishnan, Microsoft’s Chief Technology Officer for Data, recently shared some behind-the-scenes details of Azure Data Lake. We are excited to share some important news regarding one of those systems today: In November this… Read more