-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
a lot of data with more questions than pictures in SEED-Bench-2 level L2, is this reasonable? #15
Comments
Thank you very much for your interest in our work. And I have made the necessary modifications. Currently, there are 509 occurrences of the "<img>" character, which indicates the position of the corresponding images in the |
Just fixed
still no question after the last |
In fact, this format is specifically designed by us to address such issues. As mentioned in our paper, "Part-2 evaluates MLLMs' comprehension of arbitrary interleaved image-text inputs, including In-Context Captioning. In this task, two examples of image-caption pairs along with an image are provided, and the model is expected to describe the specific aspect of the image." For more details, please refer to Section 3.2.2 of our paper on SEED-Bench-2. |
Could you please give a prompt or question? We just want to add a question after the last |
Hi, following the few-shot setting of Flamingo[1], we do not provide a specific prompt for evaluating in-context captioning. Sine we adopt PPL as the evaluation metric, it may not be necessary to add a question for model testing. [1] Flamingo: a Visual Language Model for Few-Shot Learning |
SEED-Bench_v2_level1_2_3.json
example:
there are 360 questions end with this style
<img>:"
. Did you put the wrong data?The text was updated successfully, but these errors were encountered: