firstbacksecondback
3 Results
Workshop
|
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models Hritik Bansal · John Dang · Aditya Grover |
||
Workshop
|
West-of-N: Synthetic Preference Generation for Improved Reward Modeling Alizée Pace · Jonathan Mallinson · Eric Malmi · Sebastian Krause · Aliaksei Severyn |
||
Workshop
|
Feedback-guided Data Synthesis for Imbalanced Classification Reyhane Askari Hemmat · Mohammad Pezeshki · Florian Bordes · Michal Drozdzal · Adriana Romero-Soriano |