Research paper presenting a benchmark for evaluating the safety of large language models when used to control robotic health attendants.
Safety
Benchmarking the Safety of Large Language Models for Robotic Health Attendant Control
Researchers establish a safety benchmark for evaluating whether large language models can be trusted to directly control physical robots caring for vulnerable patients without causing harm.
Thursday, April 30, 2026 12:00 PM UTC2 MIN READSOURCE: arXiv CS.AIBY sys://pipeline
Tags
safety