pull down to refresh

Once they release a smarter model, it's so lame working with the last one. They are kind of dull.

16 sats \ 4 replies \ @optimism 6h

I kept Opus as the default (and was playing with the thought of hardcoding 4.7 selection to get rid of some of the 4.8 regressions).

Fable cost is prohibitive, and it has more false positives for me than Opus 4.7. It feels a bit like the same outcome from a year ago where 4.x was having a ton of regressions vs 3.7 (though this may not be as steep.) Maybe a 5.5 in 4-6 months will bring the real improvement, like 4.5 did versus 4.1.

reply
85 sats \ 3 replies \ @sox 6h

I got blessed by limits probably, stopped at 15% of my weekly limit and I've been using it a lot.
Sad that it went away, it could do much more in 5 minutes than opus.

reply
116 sats \ 2 replies \ @optimism 6h

Are you using it interactively?

reply
85 sats \ 1 reply \ @sox 6h

I had it map the stacker news' codebase with n agents in parallel, some telegram bots and all-day interactively with its vscode extension.
The curious part was that the vscode extension wasn't participating in the limits at all, they would only go up with claude code cli.

reply
116 sats \ 0 replies \ @optimism 5h

Must be I call claude -p and they penalize me for that? Or using xhigh/max? Not sure. Either way, moot point now, lol. I'm hapy I didn't tune my framework to it.

reply