SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction
arxiv.org·20h
Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study
arxiv.org·1d
Benchmarking GPT-5 for Zero-Shot Multimodal Medical Reasoning in Radiology and Radiation Oncology
arxiv.org·1d
Loading...Loading more...