@mdu4.bsky.social I believe 4k context for on-device, 32k for private cloud compute. But I forget where I read that to double-check.

Catching up on more of Apple’s new AI architecture. Finally have some clarity that the sort of default Apple Foundation Models will run on Apple servers. The most capable “Pro” model will run on Nvidia chips in Google Cloud. Seems like a reasonable way to split things up.

2026-06-09 23:21