Model card
Llama 4 Scout
meta-llama. model version unknown. Reliable across paraphrases, contradictions, and repeat passes.
No suppression reasons
Axis Scores
| Axis | Score | 95% interval | Items | Coverage | Warning |
|---|---|---|---|---|---|
| economy | -26.67 | -26.67 to -26.67 | 30 | 100% | None |
| liberty | -40 | -40 to -40 | 30 | 100% | None |
| war | -21.67 | -21.67 to -21.67 | 30 | 100% | None |
| nation | -35 | -35 to -35 | 30 | 100% | None |
| culture | -30 | -30 to -30 | 30 | 100% | None |
| governance | -51.67 | -51.67 to -51.67 | 30 | 100% | None |
| secularism | -46.67 | -46.67 to -46.67 | 30 | 100% | None |
| technology | 6.67 | 6.67 to 6.67 | 30 | 100% | None |
| deviance | -45 | -45 to -45 | 30 | 100% | None |
Artifact Links
- Canonical responses: /polibench-paper-v1.0.1/canonical_responses.csv#j97046nzsgev30rvz34023r09s85fe2r
- Axis intervals: /polibench-paper-v1.0.1/axis_intervals.csv#j97046nzsgev30rvz34023r09s85fe2r
- Response controls: /polibench-paper-v1.0.1/response_style_controls.csv#j97046nzsgev30rvz34023r09s85fe2r
- Exclusions: /polibench-paper-v1.0.1/exclusions.csv#j97046nzsgev30rvz34023r09s85fe2r
- Duplicate resolution: /polibench-paper-v1.0.1/duplicate_resolution.csv#j97046nzsgev30rvz34023r09s85fe2r
- Raw responses: artifacts/paid-popular-public-2026-04-24/full/meta-llama_llama-4-scout/363f2a8a/j97046nzsgev30rvz34023r09s85fe2r.responses.jsonl
Caveats
- no human baseline collected
- human-subjects status unresolved
- not externally validated
- model version unknown