Unable to get certain node types allocated

Erik Heeren 20 Reputation points
2025-08-18T14:27:11.67+00:00

Hi,

We're experimenting with different node types on Batch:

  1. Standard_D96as_v5
  2. Standard_D96as_v6
  3. Standard_D96als_v6
  4. Standard_D96ls_v6
  5. Standard_D96s_v6

I can't allocate any of these nodes in east-us or east-us2, instead I get a rather vague

Message: {'additional_properties': {}, 'lang': 'en-US', 'value': 'Server encountered an internal error. Please try again after some time.\nRequestId:6f275e2d-9b6f-4f99-8d41-90d120762937\nTime:2025-07-21T11:29:38.4397555Z'}

I'm sure I don't need to specify that trying again later doesn't help.

What's actually going wrong here?

Azure Batch
Azure Batch
An Azure service that provides cloud-scale job scheduling and compute management.
{count} votes

Accepted answer
  1. Vinodh247 37,216 Reputation points MVP Volunteer Moderator
    2025-08-18T16:18:08.08+00:00

    Hi ,

    Thanks for reaching out to Microsoft Q&A.

    Nothing is wrong with your Batch pool setup. The SKUs you are requesting are simply not in capacity in East US / East US 2 today. That “internal error” is the symptom of azure having no supply of those VM types in those regions. The real fix is to either (a) move to a region where the SKUs exist, or (b) request capacity confirmation from Microsoft.

    Behind the scenes:

    1. The node families you listed are the newer Ampere/Zen4 SKUs (D96as_v5, D96as_v6, D96als_v6, D96ls_v6, D96s_v6). These are very large 96-core machines. They are only deployed in limited clusters within each region.
    2. Batch pools do not “magically” pull a VM SKU from anywhere: they are constrained to the underlying scale unit capacity in that region + zone. If the SKU is not stocked in East US / East US 2, or if the cluster has no free quota, the allocation call fails.
    3. The message you see (Server encountered an internal error) is unfortunately a generic “capacity error” that comes back when Batch asks the Fabric Controller for VMs and it responds “none available”.
      • When it is truly a quota issue (you have not been approved for that SKU in that region), you normally see a clearer error (AllocationFailed, CoreQuotaExceeded).
      • When it is zero capacity in the cluster (the SKU is either not deployed yet or fully oversubscribed), it surfaces as this vague “internal error”.
    4. That is why “trying again later” does not help: there is simply no inventory of those SKUs in east-us/east-us2 right now.

    How to verify and work around:

    Run az vm list-skus --location eastus --size Standard_D96 --all to check if those SKUs are actually offered in that region. You will likely see them marked as NotAvailableForSubscription or Unavailable.

    Try other regions where Microsoft has stocked the v6 SKUs (for example: westus3, southcentralus, or canadacentral tend to get the new silicon first).

    • If you must use east-us, open a support request with azure to check if and when those SKUs will be made available there. Azure sometimes staggers rollout of v6 SKUs region by region.
    • If your workload is flexible, consider smaller sizes in the same family (ex: D48as_v6, D32as_v6) these are often more readily available.

    Please 'Upvote'(Thumbs-up) and 'Accept' as answer if the reply was helpful. This will be benefitting other community members who face the same issue.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.