Tidying in a Tokyo kitchen

Get the exact training data your robot needs, within 4 weeks.

The on-demand workforce for real-world human data. Name the task, the person, and the place, and we deliver rights-cleared video plus synced motion, built to your spec.

Request a dataset Book a call

Real-world human data, captured to your spec.

Tokyo

Stockholm

Dubai

New York

800k contributors · 50 countries

Trusted by teams building embodied AI

Northwind Robotics

Cadence AI

Meridian Labs

Vantapoint

Helio Robotics

Praxis Embodied

Sundial AI

Forge Dynamics

0k+

contributors activated

countries covered

~4 wks

custom delivery

Explore by room & task

The same task, captured across the whole world.

Maximum place and person diversity is what makes models generalise. Switch a room or task to see the breadth.

By room

By task

Tokyo, Japan

Counters, stovetop, appliances and cupboards — the densest manipulation space in the home.

🇯🇵 Tokyo · captured to spec

View kitchen datasets

The problem

The data physical AI needs was never online.

Robots and embodied agents cannot learn from web text. They need recordings of real people doing real tasks, with the physical motion attached. That data was never online. It has to be captured on purpose. The bottleneck is supply.

The hard part is getting the right person, in the right place, to capture the right thing, cleared for commercial use, reliably and at scale.

Our approach

A managed, on-demand workforce.

Rather than waiting for the right contributor to upload the right thing, we source and dispatch the exact people a dataset needs.

Marketplace / upload

“A listing waits for the right contributor to find it.”

Reactive: best suited to data that already exists in the crowd.
Less suited to specific, rare, professional, or location-bound needs.
Coverage depends on who is already in the pool.

Motionstack

“Name the task and the person and the place. We field them and deliver the dataset.”

Proactive: we source and dispatch the exact people, at volume.
Targeted by task, profession, demographic, and geography.
Sourced and dispatched to spec, drawing on a global network.

How it works

Three steps from spec to dataset.

📞01

Tell us the place and task

A task, a person type, an environment, a location, a modality, a volume, a quality bar. We turn it into a machine-readable rubric.

🎥02

Our people film it to spec

We field matching contributors and they capture on a standardised rig, so every clip is consistent and comparable.

📅03

You get it in about four weeks

A loadable LeRobot dataset: media, per-frame motion, labels, consent, provenance. Consented, owned, and cleared.

More than footage

What's in every clip.

Video by default, with per-frame motion plus optional depth, 3D hand pose, and object masks. Raw or labelled, your choice. Always with the paperwork.

video.mp4

Real people, filmed to a standardised rig.

motion.json

Camera ego-motion every frame, plus optional 3D hand pose.

labels.json

Action spans, language instructions, one taxonomy.

consent.pdf

Signed release. Owned and audit-ready.

meta/

Schema, episodes, tasks, diversity stats.

The format

Ships as native LeRobot. Auditable end to end.

Every delivery is a loadable LeRobot dataset, the format robotics labs already ingest: per-frame motion and labels in parquet, egocentric video, and language instructions, with optional depth, hand pose, and object masks. Consent, a datasheet, and a full model-and-cost record travel with it. Pull it to your storage, or via API.

See how delivery works

loadable LeRobot dataset

lerobot-dataset/

meta/# schema · episodes · tasks · diversity stats

data/chunk-000/# per-frame parquet: state, action, labels

videos/chunk-000/# the synced video streams

observation.images.head# egocentric RGB · 1080p60

observation.depth.head# depth maps (optional)

observation.masks.head# object masks (optional)

consent/# signed release · chain-of-title

processing.json# every model · time · cost per hour

DATASHEET · LICENSE# datasheet, license, readme

Native LeRobot. RLDS and LeRobot v3 export on request.

Coverage

50+ countries, more every week.

Diversity of place and person is the product. Here is some of where we field today.

AustraliaAustriaBelgiumCanadaChinaDenmarkFinlandFranceGermanyIrelandItalyJapanLuxembourgNetherlandsNew ZealandNorwayPolandPortugalSingaporeSouth KoreaSpainSwedenSwitzerlandUnited Arab EmiratesUnited KingdomUnited States

See full coverage

Licensing

We own the data and license it to you.

Pricing is per QA'd hour, with an exclusivity multiplier. Choose the tier that fits the run.

Baseline

Non-exclusive catalog

The default, and the lowest price. Re-licensable, so it ships now if it already exists.

Time-boxed exclusive

Exclusive use for 6 to 12 months, then it joins the catalog.

Top

Full exclusive buyout

We forgo all future re-licensing. The top tier.

See pricing

Get the real-world data your robot needs.

Tell us the task, the person, and the place. We field it from a network of 800k contributors and deliver it to spec, cleared for commercial training, in about four weeks.

Book a call Request a dataset