Comparing Human Gaze and Vision-Language Model Attention in Safety-Relevant Environments (opens in new tab)

Human visual attention plays an important role in how people perceive and respond to environments containing potential risks. This study investigates whether large vision-language models can identify the same regions of a scene that attract human attention in safety-relevant environments. Eye-tracking data were collected from ten participants viewing 33 scene images representing environments with varying levels of potential risk using Pupil Invi...

Read the original article