AI/ML

Microsoft Launches 7 New MAI AI Models: Reasoning, Coding, Images, Voice, and More

image
  • image
    Vimal Tarsariya
    Author
    • Linkedin Logo
    • icon
  • icon
    Jun 5, 2026

Microsoft just launched seven new AI models at Build 2026. These Microsoft MAI AI Models cover reasoning, coding, image generation, voice, and transcription, and they mark the company's biggest move yet to build its own AI instead of leaning on partners.

For years, Microsoft mostly ran its products on models from other labs. That has changed. The new family was built in-house by the Microsoft AI Superintelligence Team, and every model was trained from scratch. If you plan AI projects for your business, this shift matters. It gives you more model choices, lower costs, and tighter control over your data. Our team at Vasundhara Infotech builds with these kinds of tools every day, and you can see how we approach it on our AI services page.

This post breaks down all seven models in plain terms. We cover what each one does, who it is for, and how it fits into Azure AI Foundry.

What did Microsoft announce at Build 2026?

Microsoft Build 2026 is the company's yearly developer event. It ran on June 2 and 3, 2026, in San Francisco. The headline news was a family of seven new Microsoft AI Models, branded as MAI.

The bigger story is the strategy behind them. Microsoft has put billions of dollars into other AI labs. It invested 13 billion dollars in OpenAI and 5 billion dollars in Anthropic. Now it wants its own stack too. These Microsoft AI announcements, shared on the Microsoft AI blog, show a company that wants to own the models under its products, not just rent them. It is also a clear bet on Microsoft generative AI as a core part of its future.

There is a clear reason for this. Microsoft says all MAI models were trained on clean and licensed data, with no distillation from other labs. That answers a real worry for many buyers. It also co-designed the models with its own Maia 200 chips, which gave a 1.4 times efficiency boost.

What are the 7 Microsoft MAI AI Models?

The MAI family spans five areas: reasoning, coding, image, voice, and transcription. Some areas have a standard model and a faster Flash version. Here is each one in plain terms.

 

MAI-Thinking-1: the reasoning model

MAI-Thinking-1 is the flagship of the group. It is one of the new AI reasoning models, built for hard, multi-step problems. It is a mid-sized model with 35 billion active parameters and a 256K context window. That means it can hold a lot of text at once.

The scores are strong for its size. It hit 97 percent on the AIME 2025 math test. On SWE-Bench Pro, a tough coding benchmark, it scored 53 percent, which puts it next to Opus 4.6. In blind tests, human raters preferred it over Sonnet 4.6. It is best for CTOs and teams that need deep reasoning and code generation. It is open now on Foundry in private preview.

MAI-Code-1 Flash: the coding model

MAI-Code-1 Flash is built for developers. It is one of the new AI coding models, tuned to plan and write code from start to finish. It is small and fast, with 5 billion active parameters. Microsoft says it is on par with Haiku but cheaper to run.

The best part is where it lives. It is built into GitHub Copilot and VS Code, so your team can use it inside tools they already know. It suits engineering teams that want quick, low-cost help with daily coding tasks.

MAI-Image 2.5: the image model

MAI-Image 2.5 is one of the new AI image generation models. It handles both text-to-image and image editing. On the Arena leaderboard, it ranked No. 2 for image editing and passed the score of Nano Banana Pro.

There is also a Flash variant for high-volume work at lower cost. The full model is already live in PowerPoint and is rolling out to OneDrive. It is a good fit for marketing, design, and content teams.

MAI-Transcribe-1.5: the transcription model

MAI-Transcribe-1.5 turns audio into text. Microsoft calls it the best transcription model in the world, with top accuracy. It runs five times faster than rival models. It supports 43 languages and can handle domain-specific terms, which helps in fields like health and law.

MAI-Voice-2: the voice model

MAI-Voice-2 creates natural speech in 15 languages. It can copy a voice from a short audio sample, with safeguards to prevent misuse. A Flash version is coming soon for low-cost, fast voice agents. Voice agents are a big trend in 2026, so this model targets customer support and call center use.

Where can you run these models?

You are not locked into one place. The MAI models are on Azure AI Foundry, and they are also on OpenRouter, Fireworks AI, and Baseten. For the first time, developers can tune the model weights themselves.

How do the MAI models fit into Azure AI Foundry?

Azure AI Foundry is the home base for these models in the enterprise. Foundry gives you one platform with enterprise governance and Azure data residency. That means your data can stay in the region you choose, which is key for regulated work.

 

Microsoft also added a feature called Frontier Tuning. It lets you train a MAI model on your own work data, inside your own environment. The model learns your tasks, and the result stays yours. Microsoft says its tuned Excel model matched GPT 5.4 while being up to 10 times more efficient.

There is a health angle too. Microsoft and Mayo Clinic are building a frontier AI model for healthcare. It will use Mayo Clinic's clinical data and Microsoft's AI work. The model will be owned by Mayo Clinic and later offered to others through Foundry.

If you are weighing how these models fit your roadmap, our AI development company team can help you map the options. For strategy work, our innovation consulting service helps leaders align AI plans with real business goals.

What do the new Microsoft AI Models mean for businesses?

The launch lands in a fast-growing market. The global generative AI market is set to reach 109.37 billion dollars by 2030, growing at a 37.6 percent yearly rate. The enterprise slice is growing even faster. Enterprise generative AI is forecast to hit 19.8 billion dollars by 2030, at a 38.4 percent yearly rate.

Adoption is already wide. McKinsey reports that more than three-quarters of organizations now use AI in at least one business function. The gap is no longer about trying AI. It is about scaling it well.

For enterprise AI solutions, the MAI launch brings three real gains. First, more choice, since you can pick a model that fits the task and budget. Second, lower cost, since these are mid-sized and efficient models. Third, more control, thanks to clean training data and custom tuning. For startup founders, the low-cost coding and image models lower the barrier to building AI products. For operations and CX leaders, the voice and transcription models open new ways to handle support at scale.

What about AI compliance?

Strong models are only half the job. You also need to deploy them in a safe and legal way. This matters most in the US, UK, and UAE markets, where rules are tightening.

Start with data privacy. If you serve users in Europe, the GDPR sets strict rules on how you collect and store personal data. Azure data residency in Foundry can help you keep data in the right region. Next, look at the EU AI Act. It sorts AI systems by risk level and sets duties for higher-risk uses. If your product reaches EU users, you may need to meet these duties.

There are also industry rules. Healthcare work in the US must follow HIPAA. Voice cloning needs care too, since copying a real voice can raise consent and fraud issues. MAI-Voice-2 ships with safeguards, but you still own how you use it. A good rule is simple. Keep a human in the loop, log what the model does, and check your use against local law before you launch.

Frequently asked questions

The Microsoft MAI AI Models are a family of seven AI models built in-house by Microsoft. They were announced at Build 2026 and cover reasoning, coding, image generation, voice, and transcription. The lineup includes MAI-Thinking-1, MAI-Code-1 Flash, MAI-Image 2.5 and its Flash variant, MAI-Transcribe-1.5, MAI-Voice-2, and MAI-Voice-2-Flash.
Microsoft revealed the models at Build 2026, its yearly developer event, on June 2, 2026, in San Francisco.
MAI-Thinking-1 is Microsoft's first large reasoning model. It is built for complex, multi-step problems, long-context reasoning, and code generation. It has 35 billion active parameters and a 256K context window.
MAI-Code-1 Flash is built into GitHub Copilot and VS Code, so access depends on your Copilot plan. Microsoft positions it as a low-cost coding model, on par with Haiku but cheaper to run.
You can run the MAI models on Azure AI Foundry, which adds enterprise governance and data residency. They are also available on OpenRouter, Fireworks AI, and Baseten for developers who want other paths.
They can be, if you deploy them with care. Azure AI Foundry supports data residency, which helps with GDPR and similar rules. You still need to match your use case to laws like the EU AI Act and HIPAA, and keep human oversight in place.