MicroVMs: Run isolated sandboxes with full lifecycle control

Infrastructure
Cloud
AI
Developer Tools
Security

AWS is turning the Firecracker machinery behind Lambda into a standalone service called Lambda MicroVMs. You build an image from a Dockerfile and code, then launch short-lived VMs with suspend, resume, snapshots, and per-instance billing. The pitch is straightforward: if your product needs to run code you did not write, and you want a stronger boundary than containers, AWS will now sell that isolation directly for things like coding agents, interactive dev environments, vulnerability scanning, analytics, and scripted game servers.

Treat this as a low-level cloud primitive for short-lived, per-user isolation, not as a general replacement for containers or long-lived VMs. If you run agentic or user-code workloads, compare it directly against self-hosted Firecracker, Fly, Railway, E2B, and plain EC2 on startup latency, billing predictability, and whether suspend-resume plus snapshots actually saves enough orchestration work to justify AWS pricing.

June 26, 2026
aws.amazon.com
Discuss on HN

Key insights

Subagents make startup latency add up

The agent use case gets more concrete when you stop imagining one VM per chat session and start imagining one VM per tool call or subagent. In that model, a single user session can fan out into many isolated workers with separate credentials and short bursts of code execution. Fast boot and lower baseline memory stop being nice-to-haves and become the difference between a manageable system and one that burns hours waiting for full VMs to start.

If your agent architecture uses tool isolation, credential partitioning, or parallel subagents, model VM count per user action before dismissing boot time as irrelevant. A design that looks fine with one long-lived VM can collapse once every session fans out into dozens of short-lived workers.

Attribution:

0xbadcafebee #1
nok22kon #1
vmg12 #1
victorbjorklund #1

The hard part has shifted above the hypervisor

gVisor, libkrun, Firecracker, and QEMU all look workable now. The friction is in the missing platform details around them. Comments dug into rootless networking with passt, graphical app support, Wayland and Vulkan passthrough, overlay filesystems, terminal quirks, and audio transport. That makes the current market easier to read. The VMM is no longer the moat. The moat is packaging all the ugly integration work into something developers can actually operate.

When evaluating sandbox stacks, spend less time benchmarking raw boot speed and more time checking the edges you will trip over in production. Networking, filesystem layering, GUI support, and rootless operation will dominate build time if your product goes beyond trivial code execution.

Attribution:

coppsilgold #1 #2
eperot #1 #2

Repository state is the real payload

For coding agents, source trees quickly dominate the cost profile. One comment building on AgentCore said large Git repositories bloat MicroVMs fast, and another pointed out that EFS performs badly for build-style workloads with many small random reads and writes. The useful idea here was to keep the VM filesystem minimal and move toward lazy mounting or other strategies that avoid baking multiple gigabytes of source into snapshots. Snapshot restore only stays fast if the snapshot stays small.

If you plan to run coding agents, design storage before you choose the sandbox product. Test lazy checkout, prebuilt snapshots, and mount strategies against real repositories, because filesystem shape will decide cold start and restore times more than VM boot speed will.

Attribution:

mohsen1 #1 #2
mdeeks #1

The 8 hour cap hides a cost signal

The runtime limit is not just a product quirk. It lines up with the pricing and memory behavior that make this service unattractive as a long-lived dev box. One calculation put the break-even against exe.dev at roughly 1.65 days of continuous runtime for an 8 GB allocation, longer if you can run smaller. Another comment said a 16 vCPU and 32 GB shape works out to about four times the cost of comparable on-demand EC2. The service wants bursty usage, not persistent occupancy.

Run the math in hours of active runtime, not monthly sticker price. If an environment is busy for days at a time, move it to EC2, Fly, or another fixed-cost box and reserve MicroVMs for sessions that really sleep, fan out, or scale to zero.

Attribution:

topspin #1
otterley #1

Fargate is the wrong comparison for hostile code

The most useful defense of AWS’s positioning was that Fargate solves a different trust problem. If you are running your own code, container isolation on managed infrastructure is often good enough. If you are letting end users or agents execute arbitrary code, the desired boundary shifts to per-session VM isolation with faster pause and restore semantics. That does not make the product fully differentiated, but it does explain why AWS would expose a VM-shaped primitive instead of telling everyone to use Fargate.

Frame the choice around trust boundaries, not just packaging. If tenants can execute code you would call malicious by default, compare products on per-session isolation guarantees and recovery semantics, not on whether both accept Dockerfiles.

Attribution:

simonreiff #1
0xbadcafebee #1

Against the grain

Sandbox startups may still beat AWS on economics

The dismissive take that sandbox startups are just expensive wrappers got pushed back by the claim that major providers like Modal run on their own bare metal and can match or beat hyperscaler pricing. That does not answer the trust or durability gap, but it weakens the simple story that AWS will automatically underprice everyone by exposing Firecracker directly.

Do not assume AWS is the cost floor. Ask where a vendor runs, whether they own capacity, and how idle-heavy your workload is before deciding that a managed sandbox startup must be overpriced.

Attribution:

jacobgold #1 #2

The 8 hour limit may be technical debt

One commenter claimed the 8 hour ceiling is there because Firecracker does not reclaim RAM well and long-lived instances eventually bloat. If true, the time limit is not just product segmentation. It is a guardrail around an implementation weakness that matters if you hoped to stretch these into durable environments.

Treat the runtime cap as a real engineering constraint, not something AWS will casually raise. Load-test memory growth over repeated suspend and resume cycles before building a product that depends on near-persistent sessions.

Attribution:

fizx #1

QEMU microVMs and libvirt may be enough

For smaller teams, the argument against chasing Firecracker was pragmatic. QEMU microVMs via libvirt give up some startup speed, but they are easier to operate, more flexible, and may even place more densely at scale despite higher per-VM memory overhead. That is a reminder that the best answer for self-hosting is often the toolchain you can already run well, not the lightest possible VMM.

If you are self-hosting and do not need extreme serverless-style startup behavior, prototype with libvirt and QEMU first. Operational familiarity can beat theoretical efficiency, especially for mixed workloads and longer-lived VMs.

Attribution:

ilaksh #1
nyrikki #1

In plain english

EC2 ↩

Amazon Elastic Compute Cloud, AWS’s general-purpose virtual machine service.

EFS ↩

Amazon Elastic File System, AWS’s managed network file storage service.

Fargate ↩

AWS’s managed service for running containers without managing servers directly.

Firecracker ↩

An open source virtual machine monitor created by AWS to run lightweight virtual machines for services like Lambda.

gVisor ↩

A Google-built sandbox runtime that intercepts system calls and provides an extra isolation layer for applications, especially containers.

libkrun ↩

A library for running lightweight virtual machines, often used to isolate workloads with a VM boundary.

libvirt ↩

A common open source toolkit and API for managing virtual machines and related infrastructure.

passt ↩

A userspace networking tool that helps provide network access to isolated or rootless virtualized workloads.

QEMU ↩

A widely used open source machine emulator and virtualizer for running virtual machines.

VMM ↩

Virtual Machine Monitor, another name for the software layer that creates and runs virtual machines.

Vulkan ↩

A low-level graphics and compute API used for high-performance rendering and GPU work.

Wayland ↩

A modern Linux display system that replaces older graphics session software such as X11 in many desktop environments.

Reference links

Sandbox providers and platforms

E2B
Referenced as a higher-level sandbox platform compared with AWS’s lower-level compute primitive.
shellbox.dev
Presented as an alternative service for longer-lived SSH-managed Linux boxes with suspend and resume behavior.
Railway Sandboxes documentation
Cited as another sandbox product with pay-per-utilized-resources pricing for agent workloads.
SlicerVM
Mentioned as a self-hosted microVM-oriented offering built for alternative hosting setups.

Open source runtimes and self-hosting tools

amika roadmap
Shared as open-core work for coordinating local and remote sandboxes across providers.
Flue Framework
Mentioned as another framework in the sandbox orchestration space.
microsandbox.dev
Referenced as a friendly layer on top of libkrun for local sandboxed apps and agents.
smolvm
Pointed to for local microVMs and upcoming hosted GPU-backed support.
QEMU microvm documentation
Linked as a practical self-hosting alternative to Firecracker.
pve-microvm for Proxmox
Shared as a recent project for using microVMs inside Proxmox.

AWS references and related architecture

AWS Bedrock AgentCore Runtime docs
Used to note that AWS already had a related agent runtime using similar MicroVM technology.
AWS Lambda pricing
Linked to the pricing tab for MicroVMs during discussion of cost complexity and break-even points.
AWS Firecracker launch post
Cited in debate over whether Fargate uses Firecracker and how AWS positions these services.
Recent HN story on nested virtualization on AWS
Referenced to support the claim that newer AWS instances can run nested virtualization.

Performance, pricing, and implementation writeups

Podman, Wasm, and libkrun performance notes
Linked to illustrate compute and memory overhead concerns for libkrun-style isolation.
Ramp on building a background agent
Referenced in discussion of prebuilding snapshots with the latest repository checkout.
Justin Garrison on Fargate and Firecracker
Used as a counterpoint in the debate over Fargate’s implementation and launch speed.
Fly.io accident forgiveness policy
Shared to support Fly’s claim of more user-friendly billing behavior than hyperscaler-style surprise charges.

MicroVMs: Run isolated sandboxes with full lifecycle control

Discussion mood

Key insights

Against the grain

In plain english

Reference links

Sandbox providers and platforms

Open source runtimes and self-hosting tools

AWS references and related architecture

Performance, pricing, and implementation writeups