pull down to refresh

What was the token cost?

I haven't verified the bugs and their severity yet, but it feels like a success

I feel like that's the part where you can't really determine if it was a success until after you verify and review.

I want to be able to offload more work to AI, but I find that I'm still too distrustful of it. I'm like the micromanaging supervisor who doesn't trust his reports and ends up doing everything himself.

What was the token cost?

I used my Max plan where we don't pay per token (largely believed to be heavily subsidized and mysteriously rate/intelligence limited). It did eat into ~20% of my weekly token budget.

I want to be able to offload more work to AI, but I find that I'm still too distrustful of it. I'm like the micromanaging supervisor who doesn't trust his reports and ends up doing everything himself.

It's great at review. It's great at proving out concepts and rounding them out. It tends to write bad code and make subtle and complicated mistakes.

In a particularly tricky part of this PR, I rewrote it close to 20 times (with more targeted prompting) because what the LLMs wrote was incomprehensible and very hard to weed out why until I understood what the incomprehensible thing was doing.

reply