AI ScaffoldingExplainerJun 12, 2026, 12:29 AM· 3 min read· #2 of 2 in technology

Microsoft and Xiaomi Open-Source Breakthroughs in AI Coding Memory and Skills

Two major open-source releases—Microsoft's SkillOpt and Xiaomi's MiMo Code—are solving the biggest bottlenecks in AI software development by giving models persistent memory and self-optimizing instructions.

By Factlen Editorial Team

Share this story

Open-Source Advocates 40%AI Engineering Pragmatists 40%Enterprise Developers 20%

Open-Source Advocates: Argue that open scaffolding tools prevent vendor lock-in and democratize AI development.
AI Engineering Pragmatists: Emphasize that optimizing text instructions and external memory is vastly cheaper and more predictable than fine-tuning model weights.
Enterprise Developers: Focus on the practical benefits of deploying reliable, long-horizon AI agents in production environments.

What's not represented

· Proprietary AI vendors whose market share may be threatened by open-source scaffolding

Why this matters

By shifting the focus from training massive, expensive AI models to optimizing the open-source 'scaffolding' around them, these breakthroughs allow everyday developers to build highly capable, long-memory AI coding assistants for free—dramatically lowering the cost of enterprise software development.

Key points

Microsoft has open-sourced SkillOpt, a framework that treats text-based agent instructions like trainable neural networks.
SkillOpt boosted the accuracy of models like GPT-5.5 by 23.5 percentage points without requiring any changes to the model's underlying weights.
Xiaomi released MiMo Code V0.1.0, an open-source AI coding assistant designed to solve the 'amnesia' problem in long-horizon programming tasks.
MiMo Code uses a dedicated background subagent and a persistent memory database to summarize and store context, allowing it to handle tasks exceeding 200 steps.
In benchmark testing, MiMo Code outperformed Anthropic's Claude Code on complex software engineering evaluations.
Both tools signal a broader industry shift toward optimizing AI behavioral scaffolding rather than relying solely on larger context windows.

+23.5 pts

GPT-5.5 accuracy lift via SkillOpt

62%

MiMo Code score on SWE-Bench Pro

200+

Steps in long-horizon tasks handled by MiMo

The promise of AI coding assistants has hit a wall: long-horizon tasks. Once a project stretches past 200 steps, models suffer from "amnesia," forgetting earlier decisions as their context windows fill up. Meanwhile, the instructions guiding these agents—known as skills—remain brittle, requiring human developers to play a guessing game of prompt engineering.[1][2][3][5]

But a fundamental shift is underway in how the software industry approaches artificial intelligence. Instead of waiting for the next trillion-parameter model to solve these issues, researchers are building smarter "scaffolding" around existing, frozen models.[3][7]

In a span of just two weeks in mid-2026, two major open-source releases have fundamentally altered the AI engineering landscape. Microsoft introduced SkillOpt, a framework that trains text-based agent instructions like neural networks, while Xiaomi launched MiMo Code V0.1.0, a terminal-native coding harness that gives AI persistent, cross-session memory.[1][2]

Microsoft’s SkillOpt, released under an MIT license, tackles the brittleness of agent instructions. In modern AI applications, agents rely on a folder of markdown files that dictate their behavior, tool use, and constraints.[1][4]

SkillOpt treats plain-text markdown files as trainable parameters, optimizing agent instructions without altering model weights.

Traditionally, optimizing these markdown files meant a human developer manually tweaking text and hoping for better results. SkillOpt automates this by treating the plain text document as a trainable parameter, shifting the focus from training model weights to training behaviors.[1][3][4]

Using deep-learning concepts adapted for text—such as epochs, batch sizes, and validation gates—a separate optimizer model runs the target agent on a batch of tasks, reflects on its failures, and proposes bounded edits to the markdown file. An edit is only kept if it strictly improves the agent's score on a held-out validation set.[3][4][7]

An edit is only kept if it strictly improves the agent's score on a held-out validation set.

The results are striking. On OpenAI's GPT-5.5, SkillOpt lifted average accuracy by 23.5 percentage points across industry benchmarks, with zero added inference cost at deployment. Because the final artifact is just an optimized text file, it can be easily shared and transferred across different models and execution environments.[4][7]

While Microsoft solved the instruction problem, Xiaomi’s MiMo Code targets the memory bottleneck. Released as an open-source fork of the OpenCode project, MiMo Code is a terminal-based AI assistant designed specifically for ultra-long, multi-step coding tasks.[2][6]

To cure context-window amnesia, Xiaomi engineered a persistent memory system. Instead of relying solely on the active context window, MiMo Code deploys a dedicated background subagent that continuously monitors the session.[2][8]

MiMo Code uses a dedicated background subagent to compress and store context, preventing AI amnesia during long tasks.

As the conversation approaches its token limit, this subagent automatically condenses the interactions into a structured summary, storing it in a persistent memory file. This allows the main coding agent to continue working uninterrupted without losing its grasp on the project's architecture.[2][5][8]

Xiaomi also introduced a novel maintenance feature called "/dream". Running automatically every seven days, this background process reviews old sessions, removes duplicate memories, verifies file paths, and compresses the stored information into an updated long-term repository.[2][5]

This explicit storage-and-retrieval mechanism has yielded impressive benchmark victories. According to Xiaomi, MiMo Code achieved a 62 percent score on SWE-Bench Pro and 73 percent on Terminal Bench 2, outperforming Anthropic’s Claude Code by roughly five percentage points while using the same underlying base model.[2][6]

Xiaomi reports that MiMo Code's persistent memory architecture allows it to outperform Claude Code on major industry benchmarks.

Both releases underscore a growing consensus in the AI community: the highest-leverage optimization target is no longer the model weights, but the behavioral scaffolding surrounding them. By open-sourcing these tools, Microsoft and Xiaomi are democratizing capabilities that were previously locked behind proprietary enterprise platforms.[1][2][3][7]

Developers can now equip open-weight models like Qwen or DeepSeek with the memory and precision of frontier models, entirely avoiding vendor lock-in. As the focus shifts from training massive neural networks to optimizing agent behaviors, the barrier to building industrial-grade AI software continues to fall rapidly.[2][4][3][8]

How we got here

2025
Xiaomi announces MiMo-7B, its first open-source reasoning and coding model.
May 2026
Microsoft Research releases SkillOpt, introducing text-space optimization for AI agent skills.
June 10, 2026
Xiaomi open-sources MiMo Code V0.1.0, introducing persistent memory for long-horizon coding tasks.

Viewpoints in depth

Open-Source Advocates

Argue that tools like SkillOpt and MiMo Code prevent vendor lock-in and democratize AI development.

This camp views the release of advanced scaffolding tools as a major victory against the centralization of AI power. By proving that developers do not need proprietary frontier models to achieve state-of-the-art results—provided the external memory and instructions are optimized—these advocates believe the industry can avoid vendor lock-in. They argue that open-weight models paired with open-source scaffolding will eventually outpace closed ecosystems in both cost-efficiency and performance.

AI Engineering Pragmatists

Focus on the cost and efficiency benefits of optimizing external scaffolding over fine-tuning model weights.

For pragmatic software engineers, the appeal of SkillOpt and MiMo Code lies in their predictability and low cost. Fine-tuning a billion-parameter model is an expensive, mathematically opaque process that must be repeated whenever a new base model is released. In contrast, optimizing a markdown file or managing context via a SQLite database relies on proven, deterministic software engineering principles. This camp values the ability to version-control AI behaviors just like traditional code.

Frontier Model Developers

Acknowledge the value of scaffolding but maintain that underlying model intelligence dictates the ceiling of agent capabilities.

While recognizing that tools like SkillOpt and MiMo Code extract maximum value from current models, developers at leading AI labs argue that scaffolding is a temporary bridge. They contend that as native context windows expand to millions of tokens and models gain better intrinsic reasoning capabilities, the need for complex external memory databases and hyper-optimized instruction files will naturally diminish. To this camp, scaffolding is complementary, not a replacement for next-generation foundational models.

What we don't know

How effectively these text-optimization and memory-compression techniques will scale to non-coding domains, such as legal analysis or creative writing.
Whether proprietary frontier model developers like OpenAI and Anthropic will integrate similar persistent memory architectures natively into their commercial APIs.
The long-term storage costs and latency implications of maintaining massive SQLite memory databases for thousands of concurrent enterprise AI agents.

Key terms

Agentic Scaffolding: The external software infrastructure—such as memory databases, tool interfaces, and instruction files—built around an AI model to help it execute complex, multi-step tasks.
Context Window: The maximum amount of text an AI model can process and remember at any one time during a single interaction.
Textual Gradient Descent: A conceptual approach used by SkillOpt where an AI systematically proposes and validates small text edits to an instruction file to gradually improve performance.
SWE-Bench Pro: A rigorous industry benchmark that evaluates an AI agent's ability to resolve real-world software engineering issues sourced from GitHub.

Frequently asked

What is Microsoft SkillOpt?

It is an open-source framework that optimizes the text-based instructions (skills) given to AI agents, improving their accuracy without changing the underlying model's weights.

How does Xiaomi MiMo Code prevent AI amnesia?

It uses a background subagent to continuously summarize and store project context in a persistent memory database, allowing the main AI to recall earlier decisions even after the context window fills up.

Do I need a specific AI model to use these tools?

No. Both tools are model-agnostic. SkillOpt works with models like GPT-5.5, Claude, and Qwen, while MiMo Code can connect to backends like DeepSeek, Kimi, and GLM.

Sources

[1]VentureBeatOpen-Source Advocates
Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights
Read on VentureBeat →
[2]VentureBeatOpen-Source Advocates
Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks
Read on VentureBeat →
[3]FlowtivityAI Engineering Pragmatists
Microsoft SkillOpt: How to Train AI Agent Skills Like Neural Networks
Read on Flowtivity →
[4]ToKnow.aiAI Engineering Pragmatists
SkillOpt: Microsoft Trains Agent Instructions Instead of Model Weights, Gains +23% Accuracy
Read on ToKnow.ai →
[5]ProPakistaniEnterprise Developers
Xiaomi’s New AI Coding Agent Beats Claude Code and is Now Completely Free to Use
Read on ProPakistani →
[6]GIGAZINEOpen-Source Advocates
Xiaomi has released 'MiMo Code,' an AI agent similar to Claude Code, as open source
Read on GIGAZINE →
[7]explainx.aiAI Engineering Pragmatists
Microsoft SkillOpt: Self-Improving Agent Skills Guide 2026
Read on explainx.ai →
[8]BlogNTOpen-Source Advocates
Xiaomi MiMo Code : l'assistant IA gratuit et open source
Read on BlogNT →

Up next

AI Interpretability

Mapping the AI Mind: How Sparse Autoencoders Are Solving the Black Box Problem

Researchers at Anthropic and OpenAI have achieved major breakthroughs in 'mechanistic interpretability,' using sparse autoencoders to map millions of human-understandable concepts inside frontier AI models.

Stay informed

Every angle. Every day.

Get technology stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse technology