model.dimension = 256 # Only if the model supports MRL

| Model | MMLU | HumanEval (Code) | GSM8K (Math) | Inference Speed (t/s on A100) | | :--- | :--- | :--- | :--- | :--- | | | 58.2 | 42.6 | 61.4 | 210 | | Phi-3-mini (3.8B) | 62.0 | 45.0 | 65.0 | 195 | | Gemma-2 2B | 52.5 | 30.1 | 48.3 | 280 | | Qwen2.5-3B | 56.0 | 38.2 | 55.0 | 205 |

Calculates downward and upward (uplift) capacity based on static analysis methods.