Using Azure ML to Build Clickthrough Prediction Models

This blog post is by Girish Nathan, a Senior Data Scientist at Microsoft. Ad click prediction is a multi-billion dollar industry, and one that is still growing rapidly. In this post, we build ML models on the largest publicly available ad click prediction dataset, from Criteo. The Criteo dataset consists of some 4.4 billion advertising… Read more

Building Azure ML Models on the NYC Taxi Dataset

This blog post is by Girish Nathan, a Senior Data Scientist at Microsoft. The NYC taxi public dataset consists of over 173 million NYC taxi rides in the year 2013. The dataset includes driver details, pickup and drop-off locations, time of day, trip locations (longitude-latitude), cab fare and tip amounts. An analysis of the data… Read more

Now Available on Azure ML – Criteo's 1TB Click Prediction Dataset

This post is by Misha Bilenko, Principal Researcher in Microsoft Azure Machine Learning. Measurement is the bedrock of all science and engineering. Progress in the field of machine learning has traditionally been measured against well-known benchmarks such as the many datasets available in the UCI-ML repository, in the KDDCup and Kaggle contests and on ImageNet…. Read more

Free Webinar Tomorrow: Building Predictive Models with Large Datasets

Predictive analytics problems often involve large datasets that aren’t manageable on a single local client or even a server machine. This webinar will use the public NYC taxi ride dataset to discuss how to store, manipulate and analyze such large data sets using Azure storage, HDInsight (Hadoop) and Azure ML. We will use the new… Read more

Announcing the General Availability of Azure Machine Learning

This blog post is authored by Joseph Sirosh, Corporate Vice President of Information Management & Machine Learning at Microsoft. We built Azure Machine Learning to democratize machine learning. We wanted to eliminate the heavy lifting involved in building and deploying machine learning technology and make it accessible to everybody. Supporting open source innovation and enabling… Read more

Big Learning Made Easy – with Counts!

This post is by Misha Bilenko, Principal Researcher in Microsoft Azure Machine Learning. This week, Azure ML is launching exciting new capability for training on terabytes of data. It is based on a surprisingly simple yet amazingly robust learning algorithm that is widely used by practitioners, yet receives virtually no dedicated attention in ML literature… Read more