[2606.03648] Safety Measurements for Fine-tuned LLMs Should be Grounded in Capability
AI disclosure
Summary
Abstract page for arXiv paper 2606.03648: Safety Measurements for Fine-tuned LLMs Should be Grounded in Capability