A practical simulation workflow for building and debugging robot manipulation policies with augmented rollout videos, joint logs, VLM transcripts, and task analyzers.
Policy iteration loopPhysics-accurate simulationDynamic feedback signal engineeringIK and Vision services
Development loop
High-level workflow
01
Inputs
Define intent in a task YAML spec. <task>.yaml under policies/impl/<task>/
02
Design & implement
Coding model implements policy, analyzers, and unit tests for the task.
03
Simulate
Produces rollout.mp4, joints.csv, metrics, VLM transcript, and custom analyzer outputs.
04
Redesign
Inspect video frames, logs, and transcripts. Run ad hoc test and simulation. Design fixes.
Loop back to design & implement
Agent Foundations
Why this workflow scales
01Solid abstraction layer
IK and Vision services hide low-level math, geometry, and camera plumbing
The development agent can operate at policy/task intent level instead of raw simulator internals
Shared services keep implementations consistent across tasks
Policy files stay focused on behavior design rather than infrastructure code
02Rich feedback signal
rollout.mp4 includes 4-view video with kinematic overlays
joints.csv captures target vs applied controls at each step
Custom analyzers are dynamically coded for each task and each iteration, converting trajectories into structured diagnostics
Together, these artifacts make failure modes explicit and debuggable
03Hypothesis-driven redesign
During Redesign, the development agent can run multiple ad hoc tests per hypothesis
Tests validate root-cause assumptions before policy changes are finalized
Fast test loops reduce guesswork and prevent overfitting to one rollout
Focused regression checks help preserve fixes across iterations
Example rollouts by use case
Policy in action
Shoulder Rotation
Primary goal: rotate around the shoulder-lift hinge while keeping a controlled TCP
arc and stable gripper state using explicit arc targets + IK from
policies/impl/shoulder_rotation/shoulder_rotation.yaml.
Primary goal: trace the 12 edges of a virtual cube in the air with the gripper tip.
Uses a two-phase precomputed IK trajectory (approach + connected 16-segment cube-edge trace) from
policies/impl/path_tracing/path_tracing.yaml.
Primary goal: pick the orange box from above and place it at a different table
location. A vision-guided policy uses topdown color detection and an 8-phase IK plan
(crane → approach → descend → grip → lift → transport → lower → release) from
policies/impl/box_pick/box_pick.yaml.
Iteration 1: generated policy with control/physics issue
Primary goal: hover the gripper tip over detected top-edge segments of a fixed
mounting bracket and execute a smooth inspection scan trajectory from
policies/impl/inspection/inspection.yaml.
Primary goal: precisely grasp the cable tip and route it through sequential peg gates
using high-resolution top-camera perception, staged IK motion, and iterative policy validation from
policies/impl/cable_routing/cable_routing.yaml.