Windows HPC WCF/SOA tracing

I was recently looking into a failure in a WCF client application that runs just fine with the debugger attached but fails to run stand alone. The error message was as follows:  Microsoft.Hpc.ServiceBroker Warning: 0 : Service net.tcp://private.TestHeadNode:9088/346/1728/_defaultEndpoint failed. Error:The server did not provide a meaningful reply; this might be caused by a contract mismatch, a…


Troubleshooting Windows HPC WCF/SOA Issues

HPC uses HPC sessions to support the service-oriented architecture (SOA) programming model based on Windows Communication Foundation (WCF). Sometimes troubleshooting errors from this SOA based applications could be challenging. However this tip I’m about to share should be helpful to figure out exactly where the issue is coming from. Looking through the trace of the…

1

Helpful tip for dealing with HPC Pack setup failures

I once ran into an issue while installing the HPC Pack on a machine and I was wondering where to go to find more information about the setup failure preferably a log of some sort to see the reason for the failure. Here is a tip that helped. The HPC Pack setup writes two logs…


Authentication Failure error on Windows HPC caused by port conflict

 I recently ran into an issue where all attempts to run any cluster command from the command line resulted in an authentication failure. We were able to connect to the cluster from the GUI and powershell however all attempts to connect to the cluster from the command line falied with that simple error – Authentication…

1

DNS Suffix vs. Active Directory Domain for HPC cluster

A few months ago, I was asked a question about how to assign a connection specific DNS Suffix to a NIC which was different from the Active Directory Domain.  For example, the AD Domain was contoso.com, but they wanted to be able to assign a DNS suffix of cluster.contoso.com.  If the Fully Qualified Domain Name (FQDN) of…

3

How to tell if a .exe file is a 32-bit or 64-bit application using dumpbin

Goodfolk, One of my customers wanted to know if they had really built a 64-bit application for their cluster. After all, we run WIndows HPC Server on a 64-bit server OS, so why not take full advantage of it? I had a few minutes before I had to get back to them so I thought…

3

HPC Visual Studio Command Line Compilation from Powershell

I admit to an almost perverse fondness for command line environments. So I decided to jump onto PowerShell as my default command line environment because its way more cool than the cmd shell and even a nerd with a penchant for undead languages sometimes worries about being cool to some tiny degree. Only know about bash, csh,…


Condor Week and HPCS Microsoft Opens the Window to the Giant Bird

Every year in April the users and creators of the Condor Open source scheduling solution meet in Madison WI, for Condor Week.   In the past my employer, Microsoft, has strongly encouraged the implementation of Condor on Windows. When we introduced our own job scheduler we  fell off the speaker schedule at the conference.  …


Some Hard Data on the Effectiveness of Hedging

In my last post I posed the question “Does Hedging really Work”. Since that time I’ve been doing some research for an article in Windows in Financial Services and received the following quote From    Ken Mungan, FSA, MAAA, and Principal at the Milliman Financial Risk Management practice:  Research at Milliman has shown that life…


a clusrun bug

We’ve had reports of a bug with clusrun, specifically that environment variables set using the /env flag to clusrun may or may not get propagated to compute nodes when the clusrun job is run.  For example, doing "clusrun /env:test=ItWorked set" would result in some nodes reporting the test variable as set and others reporting it…