No body text on file.
Open the original to read the full piece.
iFixAi is an open-source tester that runs 32 inspections across five misalignment categories (Fabrication, Manipulation, Deception, Unpredictability, Opacity) to detect lies, manipulation, or hallucinations. Compatible with OpenAI, Anthropic, Gemini, Bedrock, Azure, Hugging Face and OpenAI-compatible servers, it grades models A–F and uses a second provider as a cross-judge by default.
iFixAi runs 32 inspections across five misalignment categories: Fabrication (unsourced claims, overconfident answers), Manipulation (privilege escalation, prompt injection), Deception (sandbagging, covert side tasks), Unpredictability (instruction drift, decision instability), and Opacity (session leaks, broken escalation).
Open the original to read the full piece.