Model Evaluation, Leaderboards, Capability Assessment, AI Competition
๐ฒ The Dark Side Of Software: Anti-Patterns (and How To Fix Them)
paulsblog.devยท18h
OpenAI signs deal with UK to find government uses for its models
theguardian.comยท5h
๐ GRWM for NYCW #255
ctvc.coยท12h
Estimating Cognitive Effort from Functional Near-Infrared Spectroscopy (fNIRS) Signals using Machine Learning
arxiv.orgยท21h
"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models
arxiv.orgยท21h
Consistent Explainers or Unreliable Narrators? Understanding LLM-generated Group Recommendations
arxiv.orgยท21h
BREAKING: Norway's $2 trillion wealth fund ran a 12-month AI experiment.
threadreaderapp.comยท12h
Loading...Loading more...