manton
manton

Trying to get my AI spending under control, I’m having the server randomly switch to the “flex” pricing tier in GPT-5 when there’s background work that doesn’t need to be instantly fast. So far so good.

|
Embed
Progress spinner
In reply to
lukebouch
lukebouch

@manton I bet AI spend could get out of control pretty fast. What all features are you using it for in Micro.blog?

|
Embed
Progress spinner
manton
manton

@lukebouch Still fairly limited in Micro.blog. Mostly things like accessibility text in photos and transcripts for podcasts. The big new thing is Inkwell’s “Reading Recap” feature, which creates a personal recap of last week’s blog posts in your subscriptions. That bumped up usage by over 10x.

|
Embed
Progress spinner
cdevroe@mastodon.social
cdevroe@mastodon.social

@manton I've built a local "app" (nearly an agent) that determines which models to run based on prompt heuristics. I have several local models, an image model, and several cloud based models tuned so that it knows which to call when.

It works pretty well. I’m absolutely positive something better will be created by someone else (if it doesn't exist already) but I'm still thinking of open sourcing it.

|
Embed
Progress spinner