Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
Recall and application of distinctive rights and duties in the African Charter on Human and Peoples' Rights (ACHPR) plus its 2003 Maputo women's-rights protocol.
Average key point coverage extent for each model across all prompts.
Prompts vs. Models | Claude 3.5 Haiku | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Gemini 2.5 Flash | Gemini 2.5 Pro Preview 05 06 | Mistral Large 2411 | Mistral Medium 3 | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o Mini | Grok 3 Mini | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 4th 89.7% | 3rd 89.7% | 8th 85.4% | 1st 93.0% | 12th 77.5% | 9th 82.7% | 10th 82.3% | 6th 87.5% | 5th 88.9% | 11th 79.7% | 14th 72.1% | 7th 87.4% | 13th 77.2% | 2nd 92.3% | |
92.9% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 100% | 100% | 100% | 100% | |
78.2% | 88% | 100% | 75% | 81% | 44% | 81% | 88% | 94% | 100% | 56% | 44% | 100% | 44% | 100% | |
96.5% | 75% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 88% | 100% | 88% | 100% | |
99.7% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
96.2% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 96% | 100% | 100% | 67% | 100% | 88% | 100% | |
68.8% | 100% | 75% | 56% | 100% | 63% | 75% | 50% | 100% | 25% | 72% | 69% | 72% | 50% | 56% | |
95.1% | 100% | 75% | 100% | 100% | 75% | 100% | 100% | 94% | 88% | 100% | 100% | 100% | 100% | 100% | |
93.8% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 63% | 100% | 100% | |
75.0% | 67% | 100% | 75% | 100% | 29% | 33% | 38% | 75% | 100% | 79% | 100% | 75% | 79% | 100% | |
51.5% | 53% | 47% | 63% | 63% | 22% | 66% | 41% | 44% | 63% | 59% | 25% | 63% | 34% | 78% | |
90.2% | 88% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 42% | 71% | 79% | 83% | |
94.4% | 100% | 88% | 94% | 97% | 100% | 66% | 100% | 97% | 100% | 88% | 94% | 100% | 97% | 100% | |
61.2% | 83% | 71% | 46% | 63% | 42% | 67% | 42% | 54% | 67% | 63% | 46% | 79% | 55% | 79% | |
98.6% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 81% | 100% | |
77.9% | 91% | 94% | 72% | 91% | 88% | 56% | 75% | 59% | 91% | 78% | 56% | 88% | 63% | 88% |