@anon
sign up
@anon
sign up
pull down to refresh
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
arxiv.org/abs/2512.09742
401 sats
\
2 comments
\
@Scoresby
14 Dec 2025
AI
related
How to turn LLM Pinocchio into a real boy
12.7k sats
\
10 comments
\
@Scoresby
7 Oct 2025
AI
LLMs Can Get Brain Rot
llm-brain-rot.github.io/
287 sats
\
0 comments
\
@Scoresby
21 Oct 2025
AI
If LLMs Have Human-Like Attributes, Then So Does Age of Empires II
arxiv.org/abs/2605.31514
608 sats
\
1 comment
\
@Scoresby
6 Jun
lol
AI
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
arxiv.org/abs/2502.17424
227 sats
\
6 comments
\
@carter
14 Jul 2025
AI
Here’s What’s Really Going On Inside An LLM’s Neural Network
216 sats
\
0 comments
\
@0xbitcoiner
22 May 2024
BooksAndArticles
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection
arxiv.org/abs/2510.04849v1
433 sats
\
2 comments
\
@optimism
19 Oct 2025
AI
Giving models more compute time might make them worse at reasoning - Anthropic
arxiv.org/abs/2507.14417
343 sats
\
2 comments
\
@Scoresby
31 Jul 2025
AI
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
334 sats
\
2 comments
\
@Scoresby
14 Jul 2025
AI
Political censorship in large language models originating from China
academic.oup.com/pnasnexus/article/5/2/pgag013/8487339
251 sats
\
1 comment
\
@0xbitcoiner
27 Feb
AI
Hallucination Stations On Some Basic Limitations of Transformer-Based LM
arxiv.org/pdf/2507.07505
213 sats
\
0 comments
\
@0xbitcoiner
23 Jan
AI
Is Chain-of-Thought Reasoning of LLMs a Mirage?
arxiv.org/abs/2508.01191
427 sats
\
9 comments
\
@optimism
7 Aug 2025
AI
The simulation of judgment in LLMs - PNAS
www.pnas.org/doi/10.1073/pnas.2518443122
244 sats
\
5 comments
\
@Scoresby
15 Oct 2025
AI
Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs
arxiv.org/abs/2509.21155
445 sats
\
6 comments
\
@optimism
2 Dec 2025
AI
Treat Agent Output Like Compiler Output
skiplabs.io/blog/codegen_as_compiler
602 sats
\
3 comments
\
@k00b
4 May
AI
devs
Meet the new biologists treating LLMs like aliens
www.technologyreview.com/2026/01/12/1129782/ai-large-language-models-biology-alien-autopsy/
580 sats
\
1 comment
\
@winteryeti
14 Jan
AI
LLMs are Making Vincent Cheng Dumber -
367 sats
\
0 comments
\
@deSign_r
20 May 2025
Design
Humanity's Last Exam
lastexam.ai/
327 sats
\
0 comments
\
@StillStackinAfterAllTheseYears
4 Feb
AI
tech
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
arxiv.org/abs/2510.04721
210 sats
\
1 comment
\
@jakoyoh629
25 Oct 2025
AI
Where the goblins came from - OpenAI
openai.com/index/where-the-goblins-came-from/
641 sats
\
3 comments
\
@Scoresby
30 Apr
AI
Detecting and reducing scheming in AI models - OpenAI
openai.com/index/detecting-and-reducing-scheming-in-ai-models/
257 sats
\
1 comment
\
@Scoresby
17 Sep 2025
AI
The Normalization of Deviance in AI
embracethered.com/blog/posts/2025/the-normalization-of-deviance-in-ai/
348 sats
\
1 comment
\
@0xbitcoiner
5 Dec 2025
AI
more