Skip to yearly menu bar Skip to main content


Poster
in
Affinity Workshop: Tiny Papers Poster Session 8

A Bi-Objective ε-Constrained Framework for Quality-Cost Optimization in Language Model Ensembles

Aditya Singh · Aditi Singla · Kanishk Kukreja

#267

Abstract:

We propose an ensembling framework that uses diverse open-sourced Large Language Models (LLMs) to achieve high response quality while maintaining cost efficiency. We formulate a bi-objective optimization problem to represent the quality-cost tradeoff and then introduce an additional budget constraint that reduces the problem to a straightforward 0/1 knapsack problem. We empirically demonstrate that our framework outperforms the existing ensembling approaches in response quality while significantly reducing costs.

Live content is unavailable. Log in and register to view live content