Model Evaluation, Leaderboards, Capability Assessment, AI Competition
Display Next Hackfest 2025
zamundaaa.github.io·14h
The Cost of Being Wrong
jack-vanlightly.com·14h
Fixing Engineering’s Biggest Time Suck: Finding Information
thenewstack.io·14h
Dutch CrowS-Pairs: Adapting a Challenge Dataset for Measuring Social Biases in Language Models for Dutch
arxiv.org·1h
Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers
arxiv.org·1h
Recursive Equations For Imputation Of Missing Not At Random Data With Sparse Pattern Support
arxiv.org·1h
Loading...Loading more...