6 months later: Apple Finally Shipped Local Multimodal in Xcode 27 Beta (opens in new tab)
A while ago I wrote a full llama.cpp iOS implementation using Obj-c bridge because I wanted one thing: image in -> structured JSON out -> no cloud required. It worked. It was fast enough. It was also a lot of plumbing: XCFramework builds ObjC++ bridge tokenizer/eval/sampling internals model + projector file choreography JSON guardrails everywhere Now, about 6 months later, Apple dropped Foundation Models image analysis in Xcode 27.0 beta, and i can finally call a serious on-device model witho...
Read the original article