Proposal · Project Swan

SeedSpatialcodename

In-headset, no-code creation of spatial apps and interfaces for Project Swan.

Build your own spatial workspace with your hands and voice.
No code·No PC·No install.

A person in a mixed-reality headset points at a floating glass panel that is resizing in response.
Point at an object, say “make this bigger.”
What it is
Your workspace, your vibe

Not a fixed set of apps handed to you. Design the workspace you actually want: the widgets and windows you reach for, arranged around you in 3D space and anchored where they belong.

A person relaxes on a couch in a mixed-reality headset, surrounded by their own spatial workspace: floating 2D windows with charts and a dashboard, music and video player panels, and colorful 3D widgets anchored around the room and on a shelf, one highlighted as they edit it by reaching out.
Your own workspace: 2D windows and 3D widgets, charts, music and video players, arranged around you and anchored where you want them.
Why now
The insight: point, don't prompt
Gesture grammar
  • Point + speak: edit one object.
  • Two-hand pinch: frame a region, edit many at once.
  • Point to point: “put this there,” “make this like that.”
  • Describe: create new objects, interfaces, and widgets. The one step you command the PICO OS AI companion to do.
Under the hood
  • A persistent semantic scene graph is both the live state and the final app.
  • Edits are validated patches, not regenerated code. Reversible and coherent.
  • Common edits run locally and instantly; only the creative step calls out.
  • That step is a scoped command to the PICO OS AI companion, handed the right spatial context from the scene graph.
  • In-ecosystem and server-side. The headset is the only client.
Four gestures: point to edit one object, two-hand pinch to frame a region, point-to-point to move, and an open hand to grow a new object.
The grammar, left to right: point + speak, two-hand frame, point to point, describe to create.
How it runs on PICO Spatial OS

Not a separate engine bolted onto the headset. The editor is a web app on PICO OS 6's immersive WebXR runtime. It draws input from the OS sensor stack, and the apps it builds run on that same runtime. Authoring and playback are the same layer.

OutputPublished app The scene graph serialized into an immersive WebXR shell. The OS runtime opens it from a link or QR: no store, no install, no separate build.
The toolSeedSpatial editor A web app launched from the OS like any Swan app. Runs entirely in-headset: no PC, no sideload, no dev mode.
RuntimeImmersive WebXR · three.js One WebXR runtime both authors the scene and plays it back, so what you edit is the app. No export step swaps engines. ByteDance's own WebSpatial drives the editor's 2D control panels.
InputOS sensor layer Swan's scene understanding plus eye-and-hand “look and pinch.” These OS services resolve pointing in hardware, the source of the zero-ambiguity targeting.
IntelligencePICO OS AI companion The creative step is a command to the OS's own AI companion, fed the right spatial context from the scene graph. SeedSpatial brings no separate model; it speaks to the assistant the platform already ships.
PlatformPICO Project Swan / OS 6 The headset is the only client. Auth, persistence, model calls and the deterministic publish factory run server-side.
SeedSpatial vs Google XR Blocks
SeedSpatial Google XR Blocks
How you edit Point and speak; frame a region with two hands Type or speak a prompt in Gemini chat
Selecting what to change Hand ray + gaze, resolved in the sensor layer The model infers it from your description, taking long time
Edit model Targeted patches on a persistent scene graph Regenerates the whole app each prompt
Iterating Incremental by design Limited; named as future work in their own paper
Common-edit latency Local on the headset, instant Cloud round-trip every time
Publishing Deterministic export to an instant WebXR link Re-generated WebXR build
Platform PICO Project Swan / OS 6 Android XR / Galaxy XR
Built for Non-technical people making and sharing apps Developers and designers prototyping
Why Project Swan
  • Scene understanding plus eye-and-hand “look and pinch” is the substrate pointing needs.
  • Web-native stack: immersive WebXR for the scene, ByteDance's own WebSpatial for spatial UI.
  • The PICO OS AI companion is already there to do the creative step in-ecosystem.
  • Aligns with the OS vision: less cognitive load by building your own workspace, not juggling fixed apps.
  • A flagship launch that needs UGC on day one.
Honest scope
  • For consumer creation: personalization, mini-experiences, social UGC.
  • Not a Unity replacement. Production XR still needs engineers (for now!).