PhysDB - Safety evaluation

What it is

Safety evaluation links model behavior to physical risk, not just task completion.

A robot can fail by doing the wrong action confidently, too late, too fast, or in the wrong place.

PhysDB does not certify safety; it maps where safety evidence would need to live.

requires

Physical deployment

No page should imply safety certification.

needs

Real-world validation

Simulation success is not safety clearance.

extends

Safety-relevant evaluation

Safety evaluation needs failure modes, not only success scores.