Paper List

Tag: direct_preference_optimization

2 items with this tag.

  • May 01, 2026

    SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

    • direct_preference_optimization
    • safety_constraints
    • over_refusal
  • May 01, 2026

    TI-DPO: Token-Importance Guided Direct Preference Optimization

    • direct_preference_optimization
    • token_importance
    • rlhf

Created with Quartz v4.5.1 © 2026

  • GitHub