manton
manton

When I get a questionable notification summary in the iOS 18.1 beta, I run the original text through OpenAI to compare. It’s usually better. Certainly not a deal-breaker, but the thing about AI is it needs to actually be good or you lose the illusion.

|
Embed
Progress spinner
apike@mastodon.social
apike@mastodon.social

@manton It’s telling that Apple’s model does worse, even with their expert-tuned prompt, than OpenAI’s generic chat input does. World knowledge is surprisingly helpful for summarization

|
Embed
Progress spinner
vikanezrimaya.xyz
vikanezrimaya.xyz

@manton I have an LLM enhancement feature in my in-progress Micropub client called Smart Summary, and it's basically bring-your-own-model (uses Ollama that you have to host somewhere). The prompt is also customizable.

Maybe Apple needs to consider letting users bring their own models (and prompts?), too.

|
Embed
Progress spinner
manton
manton

@apike I worry about this a little for Apple. The seamless balance of on-device models and the cloud is a nice strategy, but the small models won't get better for years because of hardware limits.

|
Embed
Progress spinner
In reply to
samradford
samradford

@manton of course, OpenAI can’t provide timely summaries of notifications on my iPhone at all. I mean, I get what you’re saying, but feels kinda moot. And I’d say the hit ratio for me with Apple Summaries has been more than high enough to heavily adopt into my daily usage.

|
Embed
Progress spinner
manton
manton

@samradford To be clear, I like the Apple summaries too. I think a cloud-based solution like OpenAI would be fast enough, though. A bigger problem is making sure it's private enough since you're sharing other people's messages.

|
Embed
Progress spinner
stevex@mastodon.social
stevex@mastodon.social

@manton @apike Small models have been getting better as training and architecture improves .. there’s a limit, but I don't think we're quite stuck where we are.

|
Embed
Progress spinner
apike@mastodon.social
apike@mastodon.social

@stevex @manton Yeah for sure – although Apple is currently (AFAIK) trying to use one small model for a wide set of tasks, there seems to be especially a lot of headroom left for distilling large models into small task-specific models.

|
Embed
Progress spinner