End-to-End Data Science Walkthrough with Spark 2.0 on Azure HDInsight Hadoop Clusters

This post is authored by Debraj GuhaThakurta, Senior Data Scientist, and Brad Severtson, Senior Content Developer, at Microsoft. The data scientists among you would have seen how Spark 2.0, which released in July 2016, offered several enhancements over Spark 1.6. These enhancements included: Easier ANSI SQL and more streamlined APIs. Improvements in the speeds of… Read more

Build & Deploy Machine Learning Apps on Big Data Platforms with Microsoft Linux Data Science Virtual Machine

This post is authored by Gopi Kumar, Principal Program Manager in the Data Group at Microsoft. This post covers our latest additions to the Microsoft Linux Data Science Virtual Machine (DSVM), a custom VM image on Azure, purpose-built for data science, deep learning and analytics. Offered in both Microsoft Windows and Linux editions, DSVM includes… Read more

Moving eBird to the Azure Cloud

Re-posted from the Azure Data Lake & HDInsight blog. Hosted by the Cornell Lab of Ornithology, eBird is a citizen science project that allows birders to submit observations to a central database. Birders seek to identify and record the birds that they discover, and can also report how much effort it took to find those… Read more

Introducing Microsoft R Server 9.0

This post is authored by Nagesh Pabbisetty, Partner Director of Program Management at Microsoft. To thrive in today’s data-driven world, businesses increasingly need more powerful analytics solutions to predict customer behavior and discover new opportunities. However, existing solutions often fail to deliver enough insights, fast enough. At Microsoft, we continue to invest deeply in advanced… Read more

Free Online Workshop on Cortana Intelligence Suite: Register Now!

Get Live, Step-by-Step Guidance from Microsoft Experts This post is authored by Matthew Calder, Senior Content Developer at Microsoft. Join us on Microsoft Virtual Academy on Tuesday December 6th 2016, from 9AM – 4PM Pacific, for an exciting look at the Cortana Intelligence Suite (CIS), and end your day with a fully working intelligent web… Read more

Data Manipulation at Scale with Microsoft R Server & Spark on Azure HDInsight

Re-posted from the Revolutions blog. Dealing with distributed data and having to program concurrent systems is not always the easiest of tasks, and data scientists familiar with R are unlikely to have extensive experience with such systems. In such scenarios, Spark offers a very popular, intuitive distributed data processing platform, with R and Python APIs… Read more

Applying Deep Learning at Cloud Scale, with Microsoft R Server & Azure Data Lake

This post is by Max Kaznady, Data Scientist, Miguel Fierro, Data Scientist, Richin Jain, Solution Architect, T. J. Hazen, Principal Data Scientist Manager, and Tao Wu, Principal Data Scientist Manager, all at Microsoft. Today’s businesses collect vast volumes of images, video, text and other types of data – data which can provide tremendous business value… Read more

Riding the Big Data Tiger

This post is authored by Omid Afnan, Principal Group Program Manager at Microsoft. Omid’s talk, “Go Big (with Data Lake Architecture) or Go Home!” will be featured at the Microsoft Machine Learning & Data Science Summit on September 26-27 in Atlanta. I’ve had the pleasure (and pains) of working in big data for years, first… Read more

5 Cloud AI Innovations at the Microsoft Machine Learning & Data Science Summit

This post is by Joseph Sirosh, Corporate Vice President of the Data Group at Microsoft. I’m excited to invite you to our first Microsoft Machine Learning & Data Science Summit, which kicks off next Monday, September 26th, in Atlanta. The Summit is a unique event for machine learning developers, data scientists and big data engineers,… Read more

What Is Your Data Science Super Power?

This post is authored by Wee Hyong Tok, Senior Data Scientist Manager, and Danielle Dean, Senior Data Scientist Lead, at Microsoft. Wee and Danielle are speakers at the upcoming Microsoft Data Science Summit on September 26-27 in Atlanta, GA. How do businesses and data scientists turn raw data into intelligent action? Why do some companies… Read more