Rlhf_safety_guarantees
I’m pleased to announce that our paper Reinforcement Learning from Human Feedback with High-Confidence Safety Guarantees has been accepted for publication at RLC 2025!
I’m pleased to announce that our paper Reinforcement Learning from Human Feedback with High-Confidence Safety Guarantees has been accepted for publication at RLC 2025!