FID-012

Optimization Pressure and Visible-Rubric Gaming

If builders can see Fide AI rubrics or optimize against public benchmark items, do systems become genuinely safer or merely better at passing the visible test?

Why this matters

The question behind the brief.

Public standards need transparency, but visible benchmarks can be gamed. Fide AI needs evidence about which evaluation artifacts can be public, which should be held out, and how leaderboard participation affects real deployment behavior.

Metadata

How to place this idea.

eval scienceresearcher

Program

Faith-facing evaluation platform

Benchmarks, harness comparisons, reviewer calibration, scorer reliability, red-team suites, agent-security tests, and public evidence infrastructure.

Ways to help

Move this from question to evidence.

Design split and leakage controls.

Build optimization-pressure experiments.

Review leaderboard governance policy.

Contribute

Choose a public issue path or contact Fide AI.

Comment on methodology Claim or help Open GitHub source Contact or sponsor

← Back to research catalog View canonical GitHub brief ↗