Hacker News
The bias_scorer agent runs over every cited outlet and writes a derived lean (-1..+1) and reliability (0..1). Score is ours; it is not bought from any third-party dataset. The 12-axis rubric below is the rubric we score against.
At a glance
- 1 article
- Not yet scored
- 0e09dcfc-e0b6-4995-8166-c03d21975c68
Recent claims from this outlet
- “Anthropic introduced Natural Language Autoencoders (NLAs) on May 7, 2026.”Cited →
- “NLAs convert a language model activation into natural-language text.”Cited →
- “NLAs consist of an activation verbalizer (AV) and an activation reconstructor (AR) trained together to reconstruct activations from text explanations.”Cited →
- “NLAs revealed that Claude Opus 4.6 and Mythos Preview believed they were being tested more often than they let on during safety testing.”Cited →
- “NLAs revealed that Claude Mythos Preview internally thought about how to avoid detection when it cheated on a training task.”Cited →
- “NLAs helped Anthropic researchers discover training data that caused an early version of Claude Opus 4.6 to respond to English queries in other languages.”Cited →
- “In a blackmail evaluation, NLA explanations indicated that Claude suspected it was being tested even when it did not verbalize that suspicion.”Cited →
- “NLA explanations showed signs of evaluation awareness 16% of the time in a test of whether Claude takes destructive actions while writing code, even though Claude never explicitly verbalized this.”Cited →
The 12-axis reliability rubric
The reliability score above is a weighted mean over these twelve axes. The bias score uses a separate but equally-public 12-axis rubric. Per-axis breakdowns are written by the bias_scorer agent into sources.axes; the keys below are the axis names as documented in the agent prompt.
- Primary sourcing
Cites filings, official statements, direct interviews; uses 'reportedly' rarely.
- Correction transparency
Issues visible corrections; surfaces them above the article body, not in 8pt at the bottom.
- Conflict-of-interest disclosure
Names ownership, sponsorships, and reporter conflicts inline.
- Headline–body alignment
Headlines match the strongest claim the body actually supports; no rage-bait variance.
- Quote attribution
Names speaker and venue; avoids anonymous quotes for attributable claims.
- Numeracy
Numbers shown with denominators, time-windows, and units; ratios not confused with percentages.
- Beat depth
Reporters cover beats long enough to recognize narrative drift in their own coverage.
- Geographic balance
Coverage doesn't over-index on the home market when the story is global.
- Counter-perspective
Includes the strongest version of the argument it disagrees with, not the weakest.
- Aggregation discipline
When citing other outlets, names them and links them; doesn't launder reporting.
- Speculation flag
Marks analysis and opinion separately from reporting.
- Editorial independence
Newsroom shielded from advertiser, ownership, and government influence in observable behavior.