How to evaluate my fine-tuned Qwen-VL model on a locally downloaded dataset?

Hi all,

Thanks very much for your great repo. Now I have the following scenario but I can't figure out what to do by only reading the documentations. I have a fine-tuned Qwen-VL model (locally) which can be loaded by `transformers.AutoModelForVision2Seq.from_pretrained`, and a HallusionBench dataset (to which I've made some modifications to the images inside) which can be loaded by `datasets.load_from_disk`. How to evaluate the model and test on my own local modified HallusionBench dataset? Also, is setting CUDA_VISIBLE_DEVICES enough for setting the GPUs to use, and how is batch size determined in the evaluation process?

Appreciations for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to evaluate my fine-tuned Qwen-VL model on a locally downloaded dataset? #1500

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to evaluate my fine-tuned Qwen-VL model on a locally downloaded dataset? #1500

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions