Paper List

Tag: jailbreak

1 item with this tag.

  • May 02, 2026

    Fine-tuning Safety: Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!

    • fine_tuning_safety
    • safety_degradation
    • jailbreak

Created with Quartz v4.5.1 © 2026

  • GitHub