Teaching a Model to Reason Before It Learns to Talk (opens in new tab)
A weekend project that turned into a bet against the whole transformer playbook. The short version Almost every AI you’ve heard of is a transformer trained on a firehose of text. It learns language first, and reasoning sort of comes along for the ride. I’m trying the opposite: a tiny model that learns logic and […]
Read the original article