Using Microsoft’s Deep Learning Toolkit with Spark on Azure HDInsight Clusters

This post is authored by Miruna Oprescu, Software Engineer, and Mary Wahl, Data Scientist at Microsoft Have you ever wondered what it would be like to combine the power of deep learning with the scalability of distributed computing? Say no more! We present a solution that uses leading-edge technologies to score images using a pre-trained… Read more

Announcing R Tools 1.0 for Visual Studio 2015

This post is authored by Shahrokh Mortazavi, Partner Director of Program Management at Microsoft. I’m delighted to announce the General Availability of R Tools 1.0 for Visual Studio 2015 (RTVS). This release will be shortly followed by R Tools 1.0 for Visual Studio 2017 in early May. RTVS is a free and open source plug-in… Read more

End-to-End Data Science Walkthrough with Spark 2.0 on Azure HDInsight Hadoop Clusters

This post is authored by Debraj GuhaThakurta, Senior Data Scientist, and Brad Severtson, Senior Content Developer, at Microsoft. The data scientists among you would have seen how Spark 2.0, which released in July 2016, offered several enhancements over Spark 1.6. These enhancements included: Easier ANSI SQL and more streamlined APIs. Improvements in the speeds of… Read more

Julia – A Fresh Approach to Numerical Computing

This post is authored by Viral B. Shah, co-creator of the Julia language and co-founder and CEO at Julia Computing, and Avik Sengupta, head of engineering at Julia Computing. The Julia language provides a fresh new approach to numerical computing, where there is no longer a compromise between performance and productivity. A high-level language that… Read more

Announcing Data Science Utilities Version 0.11, for the Team Data Science Process

This post is authored by Hang Zhang, Senior Data Scientist Manager, Gopi Kumar, Principal Program Manager, and Xibin Gao, Data Scientist, at Microsoft. Back in September 2016, we released an early public preview of Team Data Science Process (TDSP), with the goal of supporting secure collaboration within enterprise data science organizations, with capabilities such as… Read more

Jupyter Notebooks with R in Azure ML Studio

This post is authored by Andrie de Vries, Senior Program Manager at Microsoft Introduction Azure ML Studio is a powerful canvas for the composition of machine learning experiments and their subsequent operationalization and consumption. In addition to experiments, Azure ML Studio also contains Jupyter notebooks, but until now the notebook kernels have been restricted to Python… Read more

Ten Things You Can Do on the Microsoft Data Science Virtual Machine

Posted by Gopi Kumar, Senior Program Manager in the Microsoft Data Group. In November last year, we announced the availability of the Microsoft Data Science Virtual Machine (DSVM), an operating system image we published in the Azure Marketplace with a host of popular data science tools pre-installed and pre-configured. In January this year, we updated… Read more

Python Notebooks in the Cortana Analytics Gallery

Posted by the Cortana Analytics Gallery Team at Microsoft We are super excited to announce the addition of Python notebooks to the Cortana Analytics Gallery. Overview Jupyter notebooks (formerly IPython) provide a highly productive canvas for data scientists and developers to explore ideas. At its heart, Jupyter is a multi-lingual REPL (Read, Eval, print and… Read more

Announcing the Availability of the Microsoft Data Science Virtual Machine

This blog post is by Gopi Kumar, Senior Program Manager in the Microsoft Data Group. We are very happy to announce the immediate availability of the Microsoft Data Science Virtual Machine – a Windows Server 2012 -based custom virtual machine image on the Azure marketplace containing several popular tools that can be used by data… Read more

Free Webinar: Using the Cortana Analytics Process to Build Intelligent Apps

Join us at this demo-rich session next Tuesday, November 10th, to learn about the Cortana Analytics Process, a systematic approach to data science that helps you understand raw data and transform it into intelligent insights and actions. We will walk you through a real-world dataset, show you how to use JuPyter notebooks to explore the… Read more