Project Overview
The AutoLLMSelect project aims to:
- 1. Publish a comprehensive LLM benchmark dataset analysis that would facilitate a robust and unbiased LLM benchmarking.
- 2. Make the first steps towards a robust, explainable and evolving framework for automated LLM selection based on a multi-disciplinary approach that would reduce the cost for comparing a large LLM portfolio on ML datasets.
- 3. Evaluate the applicability of the framework on a use-case from the field of sustainable development.
Due to the high complexity of the problem to be solved, the proposal will present a proof-of-concept on a selected LLM portfolio, dataset portfolio, and performance metrics, based on the available data in public benchmarks. The framework would evolve and could be extended in the future with new LLMs, benchmark datasets, ML tasks, performance metrics, from both our side and the community.
The project lasts two years and will be coordinated by the Jožef Stefan Institute (JSI) in Ljubljana, Slovenia. The project will be placed at the Computer Systems Department at JSI. A two-month research stay will take place at the Machine Learning Lab, Department of Computer Science, University of Freiburg, Germany.
