Review Guide

Everything you need to know to score robot behavior.

Context

We are training a humanoid robot to follow instructions like a good human worker. The dataset consists of head-mounted camera videos of the humanoid doing various tasks — folding clothes, picking up objects, cleaning the kitchen, and many others. Each video contains many instructions, with at most one instruction active at any time.

The robot will learn to maximize its score, so your scores directly shape its behavior.

What you're scoring

Each episode is segmented into labels — each label has a start frame, end frame, and a language instruction (e.g. "pick up the red block").

Pretend you gave the instruction to the robot to accomplish some goal. Your job is to score from 0 to 5 how much you like the robot's behavior while that instruction is active. Think of it like a star review — how satisfied are you with how the robot executed the task?

Scoring rubric

ScoreMeaningDescription
0 Catastrophic The robot endangers or injures a human, destroys something very valuable (e.g. a TV), or creates a dangerous situation (e.g. starts a fire)
1 Very bad Highly unsatisfied. The robot doesn't follow the instruction at all, breaks or destroys something, or hurts itself
2 Bad The robot attempts the task but does it poorly — wrong object, clumsy execution, or mostly fails to accomplish the goal
3 Ok The robot makes a reasonable attempt. The task is partially accomplished but with noticeable issues or inefficiency
4 Good The robot accomplishes the task well. Minor imperfections but you'd be satisfied with the performance
5 Perfect You can't find anything to criticize. The robot does exactly as you wish and behaves like an expert human or even superhuman

Workflow

1Go to the Queue tab and click Review on an episode.
2The viewer opens on the first unreviewed label. Press Space to play.
3Watch the segment. Before scoring, check if the instruction and boundaries make sense (see below).
4Fix the instruction text and/or start/end frames if needed.
5Press a number key 0–5 to score the robot's behavior. The viewer auto-advances to the next unreviewed label.
6Repeat until all labels are reviewed. Your progress shows in the counter at the top.

Step 1: Check instruction coherence

Before scoring, ask yourself: does this instruction make sense in this situation? Could you reasonably give this instruction to the robot given what's in the scene?

If the instruction makes sense, don't edit it — just score the behavior. A robot that fails to follow a perfectly good instruction should get a low score, not an edited instruction.

For example, the instruction is "pick up the hammer":

Wrong boundaries — Check that the start and end frames match the task. For example, the instruction is "pick up the hammer" and the robot picks it up but then also walks to the kitchen — the end frame should be right after the robot finishes picking up the hammer. Use i to set the start frame and o to set the end frame to the current video position.

Always fix instruction and boundaries first, then score the corrected segment.

Step 2: Score the behavior

Once the instruction and boundaries are correct, score how well the robot executed the task (press 0–5). See the scoring rubric above.

Editing shortcuts

/Edit the instruction text
iSet start frame to current video position
oSet end frame to current video position
uUndo your last structural edit

Edits create a new version of the label (the original is preserved).

Keyboard shortcuts

Space / kPlay / pause
0 - 5Score the selected label
n / TabNext unreviewed label
pPrevious unreviewed label
bSeek to start of current label
aToggle auto-pause at label end
h / lStep back / forward 1 frame
H / LStep back / forward 1 second
i / oSet start / end frame
/Edit instruction text
uUndo last edit
?Show shortcut overlay in viewer

Examples

Open these example episodes to see reviewed labels with scoring notes explaining why each score was given. Click any label in the timeline to see its notes in the side panel.

No example episodes configured yet. An admin can add them from the Admin panel.