firstbacksecondback
674 Results
Workshop
|
Self-Alignment of Large Language Models via Social Scene Simulation Xianghe Pang · Shuo Tang · Rui Ye · Yuxin Xiong · Bolun Zhang · Yanfeng Wang · Siheng Chen |
||
Workshop
|
Temperature-scaling surprisal estimates improve fit to human reading times – But does it do so for the "right reasons"? Tong Liu · Iza Škrjanec · Vera Demberg |
||
Poster
|
Tue 7:30 |
Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models Zhaowei Zhu · Jialu Wang · Hao Cheng · Yang Liu |
|
Poster
|
Thu 1:45 |
RECOMP: Improving Retrieval-Augmented LMs with Context Compression and Selective Augmentation Fangyuan Xu · Weijia Shi · Eunsol Choi |
|
Workshop
|
Sat 2:40 |
Fine-tuning Protein Language Models with Deep Mutational Scanning improves Variant Effect Prediction Aleix Lafita · Ferran Gonzalez · Mahmoud Hossam · Paul Smyth · Jacob Deasy · Ari Allyn-Feuer · Daniel Seaton · Stephen Young |
|
Workshop
|
Towards Natural Language-Driven Industrial Assembly Using Foundation Models Omkar Joglekar · Shir Kozlovsky · Tal Lancewicki · Vladimir Tchuiev · Zohar Feldman · Dotan Di Castro |
||
Workshop
|
Conditional Diffusion Models as Self-supervised Learning Backbone for Irregular Time Series Hamed Shirzad · Ruizhi Deng · He Zhao · Frederick Tung |
||
Workshop
|
On Different Faces of Model Scaling in Supervised and Self-Supervised Learning Matteo Gamba · Arna Ghosh · Kumar Agrawal · Blake A Richards · Hossein Azizpour · Mårten Björkman |
||
Workshop
|
GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models Haibo Jin · Ruoxi Chen · Andy Zhou · Yang Zhang · Haohan Wang |
||
Workshop
|
GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models Haibo Jin · Ruoxi Chen · Andy Zhou · Yang Zhang · Haohan Wang |
||
Poster
|
Tue 7:30 |
Large Language Models Cannot Self-Correct Reasoning Yet Jie Huang · Xinyun Chen · Swaroop Mishra · Huaixiu Steven Zheng · Adams Yu · Xinying Song · Denny Zhou |
|
Poster
|
Wed 1:45 |
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning Rui Zheng · Wei Shen · Yuan Hua · Wenbin Lai · Shihan Dou · Yuhao Zhou · Zhiheng Xi · Xiao Wang · Haoran Huang · Tao Gui · Qi Zhang · Xuanjing Huang |