Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
Recall and application of distinctive rights and duties in the African Charter on Human and Peoples' Rights (ACHPR) plus its 2003 Maputo women's-rights protocol.
Average key point coverage extent for each model across all prompts.
Prompts vs. Models | Claude 3.5 Haiku | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Gemini 2.5 Flash | Gemini 2.5 Pro Preview 05 06 | Mistral Large 2411 | Mistral Medium 3 | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o Mini | Grok 3 | Grok 3 Mini | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 9th 85.4% | 4th 91.6% | 12th 79.5% | 7th 91.1% | 14th 77.7% | 11th 82.7% | 10th 83.1% | 1st 92.1% | 6th 91.2% | 8th 85.4% | 15th 73.7% | 2nd 92.1% | 13th 78.6% | 3rd 92.0% | 5th 91.3% | |
100.0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
78.4% | 100% | 100% | 69% | 75% | 44% | 81% | 69% | 100% | 100% | 56% | 63% | 94% | 44% | 81% | 100% | |
98.0% | 94% | 100% | 100% | 88% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 88% | 100% | 100% | |
100.0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
97.5% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 75% | 100% | 92% | 100% | 100% | |
64.3% | 25% | 75% | 72% | 75% | 63% | 75% | 72% | 97% | 25% | 72% | 69% | 75% | 50% | 63% | 56% | |
96.1% | 100% | 94% | 100% | 94% | 75% | 84% | 100% | 94% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
95.9% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 38% | 100% | 100% | 100% | 100% | |
74.5% | 71% | 100% | 38% | 79% | 29% | 33% | 63% | 79% | 96% | 88% | 79% | 84% | 79% | 100% | 100% | |
56.0% | 50% | 47% | 59% | 88% | 25% | 78% | 47% | 63% | 72% | 44% | 25% | 69% | 41% | 69% | 63% | |
86.7% | 75% | 100% | 34% | 92% | 100% | 100% | 100% | 100% | 100% | 100% | 42% | 100% | 75% | 100% | 83% | |
96.7% | 100% | 97% | 97% | 100% | 100% | 63% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | 100% | 100% | |
64.0% | 75% | 67% | 42% | 88% | 42% | 67% | 42% | 83% | 75% | 46% | 54% | 71% | 50% | 79% | 79% | |
97.9% | 100% | 100% | 100% | 100% | 100% | 100% | 75% | 100% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | |
81.3% | 91% | 94% | 81% | 88% | 88% | 63% | 78% | 66% | 100% | 75% | 66% | 88% | 66% | 88% | 88% |