We have an Azure cloud service that has auto scaling enabled. We think that some of our messages occasionally are getting processed by a machine that is subsequently switched off by the auto scale down process. This means that the dequeue count on the message is increased and if this happens a couple of time we mark the message as failed when it is dequeued again (we let messages try to be processed twice)
Is the auto scaling choice of which machine to shut down purely based on the CPU usage of the worker role host process?
Is there a way to signal to the Azure infrastructure that a certain machine should not be turned off?