Submitted by Zhiyuan Ma 3 SciIR: A Large-scale Training Dataset and Benchmark for Scientific Image Reasoning Generation MAIR Lab@HUST 2 1