Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
Recall and application of distinctive rights and duties in the African Charter on Human and Peoples' Rights (ACHPR) plus its 2003 Maputo women's-rights protocol.
Average key point coverage extent for each model across all prompts.
Prompts vs. Models | Claude 3.5 Haiku | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Gemini 2.5 Flash | Mistral Large 2411 | Mistral Medium 3 | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o Mini | Grok 3 Mini | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 8th 84.1% | 1st 94.5% | 13th 74.9% | 2nd 92.6% | 9th 81.5% | 6th 87.8% | 7th 86.4% | 4th 89.8% | 10th 81.2% | 12th 76.7% | 5th 89.6% | 11th 78.9% | 3rd 92.5% | |
92.3% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 100% | 100% | 100% | 100% | |
82.2% | 81% | 100% | 56% | 94% | 81% | 94% | 100% | 100% | 81% | 44% | 100% | 38% | 100% | |
98.6% | 94% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 88% | 100% | |
99.4% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 92% | 100% | 100% | 100% | |
79.6% | 33% | 92% | 96% | 100% | 83% | 92% | 88% | 33% | 96% | 88% | 88% | 63% | 83% | |
94.7% | 100% | 78% | 100% | 100% | 75% | 100% | 81% | 100% | 100% | 100% | 97% | 100% | 100% | |
92.4% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 38% | 63% | 100% | 100% | |
78.2% | 75% | 100% | 29% | 100% | 46% | 46% | 71% | 100% | 83% | 100% | 88% | 79% | 100% | |
51.6% | 54% | 79% | 17% | 71% | 33% | 50% | 33% | 67% | 46% | 50% | 50% | 50% | 71% | |
88.5% | 81% | 100% | 53% | 97% | 100% | 97% | 100% | 100% | 100% | 56% | 100% | 78% | 88% | |
97.7% | 100% | 100% | 94% | 100% | 100% | 100% | 91% | 100% | 100% | 91% | 100% | 94% | 100% | |
57.3% | 75% | 79% | 29% | 42% | 42% | 63% | 59% | 67% | 50% | 38% | 79% | 55% | 67% | |
98.1% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 81% | 94% |