reply on: What are your most surprising personal uses of LLMs? \ ~hyperlinks

pull down to refresh

206 sats \ 5 replies \ @SimpleStacker 3 May \ parent \ on: What are your most surprising personal uses of LLMs? AI AskSN

I definitely keep memory off and start new chats for every new conversation/topic. Hopefully that's enough to keep things mostly on track and rational

104 sats \ 4 replies \ @optimism 3 May

It helps a lot. One other thing that may help is to throw away conversations that got an answer that missed the spot and clarify the question in a new session, instead of arguing. Arguing definitely poisons context, and the initial tokens in the response missing the spot too. Cleaner that way.

85 sats \ 3 replies \ @SimpleStacker 3 May

Yes.... I do argue sometimes but it definitely poisons the well, in terms of triggering sycophancy.

Lately, I've found Opus to be more sycophantic, and ChatGPT to be almost too obstinate (or too unwilling to explore heterodox opinions)

16 sats \ 2 replies \ @optimism 4 May

Opus has been tuned for instruction following so I'd use that model to make it do things; it's been trained on Claude Code conversations. GPT is trained to solve "complex problems".

Give the models what they are trained for: Ask open research questions to GPT, maybe add some roleplay. For Claude straight out just order it to research something.

If you need validation, ask the question you think you have the answer to without revealing the answer.

85 sats \ 1 reply \ @SimpleStacker 4 May

good thoughts

do you use any models outside of Claude and gpt?

118 sats \ 0 replies \ @optimism 4 May

For chat? I think I only use chat once a month. Grok works fine too. If I really have a one-off that doesn't fit in the framework, I just use Arena and yolo me an answer.

For dev-adjacent work I use mainly Claude, and GPT and GLM as secondaries. I used to use Kimi before GLM-5 was released.

For operational/integrated LLM flows for work (these must be local - no spyware!!!) I use mostly Gemma and Jan for structured output work and Qwen for embeddings.