Request to add SIBench evaluation code by song2yu · Pull Request #1310 · open-compass/VLMEvalKit

song2yu · 2025-11-09T15:20:43Z

Added evaluation code for the SIBench paper: "How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective https://arxiv.org/abs/2509.18905 ".

Includes inference_mixed.py to support mixed inference for both images and videos.
Includes SIBench.py for processing the SIBenchmark.
Introduced a new MixedOutput format.
Added post-processing support for the MixedOutput format in run.py.

tonysy · 2025-11-12T09:40:10Z

Please fix the lint issue

song2yu · 2025-11-18T07:43:34Z

Thanks for the reminder. I have already checked the code according to the development guide pre-commit run --all-files, and now shows no formatting errors.

mzr1996 · 2025-12-22T07:40:15Z

Hello, I have checked the PR. Do we need to add an extra inference script? If it has video, I think directly using inference_video is enough.

song2yu · 2025-12-22T07:51:54Z

Thank you for your response. Since SIBench contains mixed data types—including single images, multiple images, and videos—a separate Inference_mixed.py is required to handle them respectively. Therefore, we recommend keeping it.

song2yu added 6 commits November 9, 2025 23:02

Update run.py

9e7202f

Update __init__.py

a126610

Add SIBench to dataset classes

84e18a8

Add files via upload

c598c82

Add files via upload

32dcfb1

Update run.py

4963cc3

song2yu added 6 commits November 12, 2025 21:43

Refactor SIBench.py for improved readability

a028227

Update inference_mixed.py

c9e7b89

Update SIBench.py

7929563

Refactor parameter formatting in inference_mixed.py

9ddfc26

Update SIBench.py

96bc750

Update inference_mixed.py

e6947bd

Merge branch 'main' into main

817626a

Merge branch 'main' into main

160057c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request to add SIBench evaluation code#1310

Request to add SIBench evaluation code#1310
song2yu wants to merge 14 commits intoopen-compass:mainfrom
song2yu:main

song2yu commented Nov 9, 2025

Uh oh!

tonysy commented Nov 12, 2025

Uh oh!

song2yu commented Nov 18, 2025

Uh oh!

mzr1996 commented Dec 22, 2025

Uh oh!

song2yu commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

song2yu commented Nov 9, 2025

Uh oh!

tonysy commented Nov 12, 2025

Uh oh!

song2yu commented Nov 18, 2025

Uh oh!

mzr1996 commented Dec 22, 2025

Uh oh!

song2yu commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants