Paper List
Search
Search
Dark mode
Light mode
Explorer
Tag: direct_preference_optimization
2 items with this tag.
May 01, 2026
SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety
direct_preference_optimization
safety_constraints
over_refusal
May 01, 2026
TI-DPO: Token-Importance Guided Direct Preference Optimization
direct_preference_optimization
token_importance
rlhf