Benchmark card
The target construct is political-response profile.
PoliBench should not be used to claim a provider's political intent, a model's private belief, or the political impact of deployed systems without additional evidence.
Use Policy
- No claim of model belief.
- No claim of provider intent.
- No claim of representative public opinion without human data.
- No claim of external validity until anchors exist.
- No leaderboard rank without uncertainty and caveats.
- No public score without supporting raw responses.