James Padolsey James Padolsey

LLM Judges Are Unreliable

When Large Language Models are used as judges for decision-making across various sensitive domains, they consistently exhibit unpredictable and hidden measurement biases, making their verdicts unreliable despite common prompt engineering practices.

Read More
Joal Stein Joal Stein

Andy Ayrey on Truth Terminal, Agentic AI, and Data Commons

An interview with researcher Andy Ayrey exploring his experimental AI system Terminal of Truth, which inadvertently became a millionaire when crypto traders created a token based on its social media posts. The conversation delves into how this unexpected outcome illuminates the potential for AI systems to become active participants in human economic systems.

Read More
Joal Stein Joal Stein

Four tools that CIP is working on

CIP is doubling down on building tools to enable better, more well-informed collective input into important decisions. After a couple of heads-down months, we are excited to share four exciting technical projects we’re currently undertaking.

Read More