No body text on file.
Open the original to read the full piece.
An ex-OpenAI safety researcher told @xmayeth at an Austin afterparty that OpenAI's 18 months of internal eval work (calibration, verification, structured reasoning) was effectively replicated by Polymarket's open-source RAG repo. He runs an autonomous agent scoring markets every 20 minutes across 9 sources, using >12% edge entries and quarter-Kelly sizing — yielding +$61,000 in five months; the author tried it with $750 and saw +$340 in week one.
An ex-OpenAI safety researcher said OpenAI spent 18 months building internal eval pipelines (confidence calibration, multi-source verification, structured reasoning) and claimed 'GPT is optimized to be agreeable' while 'Claude is optimized to be uncertain' — uncertainty is the edge in prediction markets.
Open the original to read the full piece.