Skip to content

HKUST-KnowComp/InferenceDynamics

Repository files navigation

INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling

ACL 2026 Main Conference

arXiv:2505.16303

InferenceDynamics is a framework for routing user queries across large language models through structured capability and knowledge profiling.

Overview

InferenceDynamics profiles both the capability requirements and knowledge needs of a query, then routes it to the model that offers the best balance between quality and cost. The framework is designed to scale to large model pools and adapt to new models without retraining the router.

Repository

This repository contains the code for InferenceDynamics, including routing, evaluation, and benchmark-related components.

Data Availability

We will release the related data and additional resources as soon as possible.

About

[ACL 2026 Main] InferenceDynamics: Adaptive LLM Routing through Structured Capability and Knowledge Profiling

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors