The Physical AI Data Foundry

Approach

Not by hand.

But automated.

Today, most robotics data is collected through teleoperation — a human operates a robot, one task at a time. It's slow, expensive, and impossible to scale. And none of it happens in kitchens.

The Lili-o foundry runs household tasks autonomously across 50+ purpose-built home environments — kitchens, living rooms, bathrooms, laundry rooms — robots operate 24/7, retry on failure, and generate episodes continuously. No human operator. No wage floor. No ceiling on scale.

KitchenLiving roomBathroomLaundry roomBedroom
24/7
operation
50+
home environments
0
operators
Lili-o foundry environments
Comparison

The market settled for trade-offs. We didn't.

SimulationHuman-CentricTéléopérationLili-o
Rich MetadataLowMediumHighHigh
Environment DiversityHighHighLowHigh
PriceMediumLowHighHigh
Cross-embodimentNoYesNoYes
ScalableHighMediumLowHigh
CompaniesLightwheel · NVIDIAScale · SenseiroboticTutor · Figure · AgibotLili-o

*EU AI Act compliant

How it works

One use case. Deployed everywhere. Running forever.

Each use case is a self-contained, hardware-agnostic program that runs on any compatible robot without modification. Build it once — deploy it across the entire fleet.

01

Hardware agnostic

One use case, any robot. Unitree, Rainbow, Agibot — no reprogramming.

02

One-Shot learning

5-minute demonstration → autonomous skill. Industry standard: 1 hour+.

Learn how the One-Shot method works →
03

Perpetual output

One build. Infinite episodes. 24/7 generation across the entire fleet.

The data

The modalities that actually matter.

Force-torque and proprioceptive signals are absent from almost all existing public datasets. This is the gap that separates models that look like they're manipulating from models that can actually feel it.

01

RGB Video

Multi-view synchronized capture

02

Depth Maps

Aligned to RGB at every frame

03

Tactile / Force-Torque

The signal almost no dataset has

04

Joint Encoders

Full proprioceptive state

05

Episode Metadata

Auto-labeled, structured, ready-to-train

5 modalities. Synchronized. Auto-labeled. Cross-embodiment compatible. EU AI Act compliant.

Second channel

Real homes. Real people. Real tasks.

Our second collection channel sends instrumented participants into their own homes — kitchens, bathrooms, laundry rooms — wearing RGB-D cameras and haptic gloves, performing everyday household tasks as they naturally would.

This captures the environmental chaos, behavioral variance, and physical interaction that a controlled environment can never replicate. The mess on the counter. The wet dish. The awkward cabinet angle.

RGB-DTactile / hapticDiverse home layoutsNatural behaviorEU AI Act ✓
Kitchen
  • Dish washing
  • Meal prep
  • Appliance use
  • Counter cleaning
Living room
  • Object sorting
  • Table setting
  • Tidying
  • Vacuuming
Laundry room
  • Folding clothes
  • Loading washer
  • Ironing
  • Sorting laundry
Bathroom
  • Surface wiping
  • Bin handling
  • Towel folding
  • Cleaning fixtures

Ready to train on the data Physical AI has been missing?

Request access