items/1340589/related \ ~hyperlinks

pull down to refresh

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs arxiv.org/abs/2512.09742

401 sats \ 2 comments \ @Scoresby 14 Dec 2025 AI

related

How to turn LLM Pinocchio into a real boy

12.7k sats \ 10 comments \ @Scoresby 7 Oct 2025 AI

LLMs Can Get Brain Rot llm-brain-rot.github.io/

287 sats \ 0 comments \ @Scoresby 21 Oct 2025 AI

If LLMs Have Human-Like Attributes, Then So Does Age of Empires II arxiv.org/abs/2605.31514

608 sats \ 1 comment \ @Scoresby 6 Jun lol AI

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs arxiv.org/abs/2502.17424

227 sats \ 6 comments \ @carter 14 Jul 2025 AI

Here’s What’s Really Going On Inside An LLM’s Neural Network

216 sats \ 0 comments \ @0xbitcoiner 22 May 2024 BooksAndArticles

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection arxiv.org/abs/2510.04849v1

433 sats \ 2 comments \ @optimism 19 Oct 2025 AI

Giving models more compute time might make them worse at reasoning - Anthropic arxiv.org/abs/2507.14417

343 sats \ 2 comments \ @Scoresby 31 Jul 2025 AI

Context Rot: How Increasing Input Tokens Impacts LLM Performance research.trychroma.com/context-rot

334 sats \ 2 comments \ @Scoresby 14 Jul 2025 AI

Political censorship in large language models originating from China academic.oup.com/pnasnexus/article/5/2/pgag013/8487339

251 sats \ 1 comment \ @0xbitcoiner 27 Feb AI

Hallucination Stations On Some Basic Limitations of Transformer-Based LM arxiv.org/pdf/2507.07505

213 sats \ 0 comments \ @0xbitcoiner 23 Jan AI

Is Chain-of-Thought Reasoning of LLMs a Mirage?arxiv.org/abs/2508.01191

427 sats \ 9 comments \ @optimism 7 Aug 2025 AI

The simulation of judgment in LLMs - PNAS www.pnas.org/doi/10.1073/pnas.2518443122

244 sats \ 5 comments \ @Scoresby 15 Oct 2025 AI

Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs arxiv.org/abs/2509.21155

445 sats \ 6 comments \ @optimism 2 Dec 2025 AI

Treat Agent Output Like Compiler Output skiplabs.io/blog/codegen_as_compiler

602 sats \ 3 comments \ @k00b 4 May AI devs

Meet the new biologists treating LLMs like aliens www.technologyreview.com/2026/01/12/1129782/ai-large-language-models-biology-alien-autopsy/

580 sats \ 1 comment \ @winteryeti 14 Jan AI

LLMs are Making Vincent Cheng Dumber -

367 sats \ 0 comments \ @deSign_r 20 May 2025 Design

Humanity's Last Exam lastexam.ai/

327 sats \ 0 comments \ @StillStackinAfterAllTheseYears 4 Feb AI tech

BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs arxiv.org/abs/2510.04721

210 sats \ 1 comment \ @jakoyoh629 25 Oct 2025 AI

Where the goblins came from - OpenAI openai.com/index/where-the-goblins-came-from/

641 sats \ 3 comments \ @Scoresby 30 Apr AI

Detecting and reducing scheming in AI models - OpenAI openai.com/index/detecting-and-reducing-scheming-in-ai-models/

257 sats \ 1 comment \ @Scoresby 17 Sep 2025 AI

The Normalization of Deviance in AI embracethered.com/blog/posts/2025/the-normalization-of-deviance-in-ai/

348 sats \ 1 comment \ @0xbitcoiner 5 Dec 2025 AI