pull down to refresh
It helps a lot. One other thing that may help is to throw away conversations that got an answer that missed the spot and clarify the question in a new session, instead of arguing. Arguing definitely poisons context, and the initial tokens in the response missing the spot too. Cleaner that way.
Yes.... I do argue sometimes but it definitely poisons the well, in terms of triggering sycophancy.
Lately, I've found Opus to be more sycophantic, and ChatGPT to be almost too obstinate (or too unwilling to explore heterodox opinions)
Opus has been tuned for instruction following so I'd use that model to make it do things; it's been trained on Claude Code conversations. GPT is trained to solve "complex problems".
Give the models what they are trained for: Ask open research questions to GPT, maybe add some roleplay. For Claude straight out just order it to research something.
If you need validation, ask the question you think you have the answer to without revealing the answer.
good thoughts
do you use any models outside of Claude and gpt?
For chat? I think I only use chat once a month. Grok works fine too. If I really have a one-off that doesn't fit in the framework, I just use Arena and yolo me an answer.
For dev-adjacent work I use mainly Claude, and GPT and GLM as secondaries. I used to use Kimi before GLM-5 was released.
For operational/integrated LLM flows for work (these must be local - no spyware!!!) I use mostly Gemma and Jan for structured output work and Qwen for embeddings.
I definitely keep memory off and start new chats for every new conversation/topic. Hopefully that's enough to keep things mostly on track and rational