Refactor LLM Configuration to YAML-Based System by LeonWehrhahn · Pull Request #386 · ls1intum/Athena

LeonWehrhahn · 2025-01-03T11:47:25Z

Motivation and Context

This PR rewrites the llm_core module configuration system to address current limitations. The core motivation behind these changes is threefold:

Granular LLM Model Selection for Tasks: We need the ability to specify different LLM models for different tasks. For example, using a high-powered, but potentially more costly, LLM model for low-volume complex operations like generating initial structured grading instructions, while employing a faster, more economical LLM model for high-volume tasks like generating feedback on individual student submissions.
Flexible and Comprehensive LLM Model Configuration: We need the ability to configure not only the LLM model to use but also its inherent capabilities (e.g., whether it supports function calling or structured output) and default settings (e.g., temperature, top_p). This is crucial for supporting a diverse range of LLM models.
Preserved Dynamic Configuration Overrides via Headers: While not a new feature, we want to retain the existing ability to dynamically override LLM model configurations via x- headers in API requests, as used in the Athena playground.

Description

To achieve the outlined goals we introduced two YAML files to manage model configurations and capabilities:

llm_capabilities.yml (llm_core): This file defines the core capabilities of different LLM models. It specifies default settings (like temperature, top_p) and flags for supported features (like supports_function_calling, supports_structured_output). Importantly, it also allows for LLM model-specific overrides to these defaults. This file resides at the top level of the llm_core directory and is therefore the same fore ach module - (e.g., module_modeling_llm, module_programming_llm).
llm_config.yml (module-specific): Each module (e.g., module_modeling_llm, module_programming_llm) now has its own llm_config.yml located at the root level of the module. This file specifies the concrete models to be used for different tasks within that module. For example, the modeling module might specify a powerful model like openai_o1 for generating grading instructions and a faster, more economical model like openai_4o for generating feedback. Switching from environment variables to module-level YAML files for LLM configuration brings these settings under version control, ensuring consistent deployments and eliminating the risk of environment-specific discrepancies.

A lot of other aspects of the llm_module were changed to support this new YAML-based configuration approach. These changes are outlined in more detail in the README.

Steps for Testing

Verify Model Configuration:
- Ensure that the llm_config.yml and llm_capabilities.yml files are correctly parsed.
- Check that different modules (e.g., module_modeling_llm) can successfully load and use their specified model configurations.
Test Feedback Generation:
- Test if the Feedback Generation is still working in each module (e.g., module_modeling_llm, module_programming_llm).

Testserver States

Note

These badges show the state of the test servers.
Green = Currently available, Red = Currently locked
Click on the badges to get to the test servers.

Screenshots

…apping and uniqueness resolution

…tum/Athena into feature/modeling/reference

…ion logic

…tum/Athena into feature/modeling/reference

…n feedback model conversion

…tionships; fix foreign key references and ensure proper inheritance structure.

…remove debug prints, update caching logic, and change serialization method for structured grading instructions

…d usage guidelines

github-actions · 2025-01-31T20:15:22Z

⚠️ Unable to deploy to test servers ⚠️

The docker build needs to run through before deploying.

…tions

github-actions · 2025-01-31T21:01:52Z

⚠️ Unable to deploy to test servers ⚠️

The docker build needs to run through before deploying.

LeonWehrhahn and others added 23 commits November 16, 2024 19:30

Refactor feedback reference

f510596

Merge branch 'develop' into feature/modeling/reference

ba85feb

Enhance Apollon JSON transformer and parser for improved element ID m…

6b98bb8

…apping and uniqueness resolution

Merge branch 'feature/modeling/reference' of https://github.com/ls1in…

2bba32f

…tum/Athena into feature/modeling/reference

Merge branch 'develop' into feature/modeling/reference

ac31664

Merge branch 'develop' into feature/modeling/reference

bf5a80d

Add element_ids field to ModelingFeedback and update feedback convers…

9de3f10

…ion logic

Merge branch 'feature/modeling/reference' of https://github.com/ls1in…

e15d11c

…tum/Athena into feature/modeling/reference

Add element_ids field to DBModelingFeedback model

1e8e21d

Add JSON type import to db_modeling_feedback.py

a5203ff

Merge branch 'develop' into feature/modeling/reference

2a169b8

Merge branch 'develop' into feature/modeling/reference

b6af29a

add structured grading instruction cache

7b488a3

Increase default max_tokens to 4000 in OpenAI model configuration

f2604c9

Increase max_input_tokens to 5000 and update element_ids assignment i…

87a66a7

…n feedback model conversion

Refactor exercise models to implement polymorphism and establish rela…

322fd55

…tionships; fix foreign key references and ensure proper inheritance structure.

Merge branch 'feature/modeling/reference' into feature/modeling/caching

fda3f13

Refactor exercise storage and structured grading criterion handling; …

a83ac8b

…remove debug prints, update caching logic, and change serialization method for structured grading instructions

Merge branch 'develop' into feature/modeling/caching

55449ec

Fix pylint errors

4292489

Add LLM configuration files; refactor model handeling and prompts

f37b837

Merge remote-tracking branch 'origin/develop' into feature/model-choice

b00c6de

Refactor model configuration types to use ModelConfigType

b66c99f

github-actions Bot assigned LeonWehrhahn Jan 3, 2025

LeonWehrhahn changed the title ~~Feature/model choice~~ Refactor LLM Configuration to YAML-Based System with Multiple LLM Model Support Jan 3, 2025

LeonWehrhahn changed the title ~~Refactor LLM Configuration to YAML-Based System with Multiple LLM Model Support~~ Refactor LLM Configuration to YAML-Based System Jan 3, 2025

LeonWehrhahn added 3 commits January 3, 2025 13:30

Add README for llm_core module; refactor OpenAI model config structure

8099d48

Enhance README for llm_core module with detailed content structure an…

590e101

…d usage guidelines

Refactor model configs

12f5428

LeonWehrhahn requested a review from EneaGore January 9, 2025 20:42

LeonWehrhahn added the deploy:athena-test1 Athena Test Server 1 label Jan 20, 2025

EneaGore removed the deploy:athena-test1 Athena Test Server 1 label Jan 20, 2025

LeonWehrhahn added 2 commits January 31, 2025 20:56

Merge branch 'develop' into feature/model-choice

652c3d0

Add AzureModelConfig to ModelConfigType for multi-provider support

e060cd1

LeonWehrhahn added the deploy:athena-test1 Athena Test Server 1 label Jan 31, 2025

github-actions Bot removed the deploy:athena-test1 Athena Test Server 1 label Jan 31, 2025

github-actions Bot added the deployment-error Added by deployment workflows if an error occured label Jan 31, 2025

LeonWehrhahn added deploy:athena-test1 Athena Test Server 1 and removed deployment-error Added by deployment workflows if an error occured labels Jan 31, 2025

LeonWehrhahn temporarily deployed to athena-test1.ase.cit.tum.de January 31, 2025 20:57 — with GitHub Actions Inactive

github-actions Bot removed the deploy:athena-test1 Athena Test Server 1 label Jan 31, 2025

Remove use_function_calling parameter from suggestion generation func…

87a93b8

…tions

LeonWehrhahn added the deploy:athena-test1 Athena Test Server 1 label Jan 31, 2025

github-actions Bot added lock:athena-test1 Is currently deployed to Athena Test Server 1 and removed deploy:athena-test1 Athena Test Server 1 labels Jan 31, 2025

github-actions Bot added the deployment-error Added by deployment workflows if an error occured label Jan 31, 2025

LeonWehrhahn removed the deployment-error Added by deployment workflows if an error occured label Jan 31, 2025

EneaGore removed the lock:athena-test1 Is currently deployed to Athena Test Server 1 label Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor LLM Configuration to YAML-Based System#386

Refactor LLM Configuration to YAML-Based System#386
LeonWehrhahn wants to merge 29 commits into
developfrom
feature/model-choice

LeonWehrhahn commented Jan 3, 2025

Uh oh!

github-actions Bot commented Jan 31, 2025

Uh oh!

github-actions Bot commented Jan 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LeonWehrhahn commented Jan 3, 2025

Motivation and Context

Description

Steps for Testing

Testserver States

Screenshots

Uh oh!

github-actions Bot commented Jan 31, 2025

⚠️ Unable to deploy to test servers ⚠️

Uh oh!

github-actions Bot commented Jan 31, 2025

⚠️ Unable to deploy to test servers ⚠️

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants