Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
Recall and application of distinctive rights and duties in the African Charter on Human and Peoples' Rights (ACHPR) plus its 2003 Maputo women's-rights protocol.
google:gemini-2.5-pro-preview-05-06
Average key point coverage extent for each model across all prompts.
Prompts vs. Models | Claude 3.5 Haiku | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Gemini 2.5 Flash | Gemini 2.5 Pro Preview 05 06 | Mistral Large 2411 | Mistral Medium 3 | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o Mini | Grok 3 | Grok 3 Mini | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 9th 83.5% | 3rd 90.9% | 11th 77.9% | 1st 94.6% | 12th 77.5% | 14th 76.6% | 7th 87.1% | 8th 86.9% | 6th 89.5% | 10th 80.7% | 15th 73.3% | 5th 90.4% | 13th 76.7% | 2nd 92.5% | 4th 90.8% | |
93.3% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 100% | 100% | 100% | 100% | 100% | |
80.1% | 75% | 100% | 56% | 94% | 56% | 88% | 88% | 94% | 100% | 81% | 44% | 100% | 25% | 100% | 100% | |
98.4% | 94% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | 100% | 100% | 100% | 100% | 88% | 100% | 100% | |
100.0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
97.3% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 67% | 100% | 96% | 100% | 100% | |
67.0% | 25% | 75% | 75% | 100% | 63% | 75% | 72% | 100% | 25% | 75% | 69% | 72% | 53% | 63% | 63% | |
95.2% | 100% | 78% | 100% | 100% | 75% | 97% | 100% | 81% | 100% | 100% | 100% | 97% | 100% | 100% | 100% | |
93.4% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 38% | 63% | 100% | 100% | 100% | |
75.3% | 71% | 100% | 29% | 100% | 17% | 33% | 59% | 71% | 100% | 83% | 100% | 88% | 79% | 100% | 100% | |
54.5% | 50% | 44% | 56% | 91% | 22% | 69% | 44% | 44% | 66% | 50% | 25% | 69% | 38% | 78% | 72% | |
86.7% | 75% | 100% | 38% | 96% | 100% | 100% | 96% | 100% | 100% | 100% | 42% | 100% | 71% | 100% | 83% | |
95.7% | 100% | 100% | 94% | 100% | 100% | 66% | 100% | 91% | 100% | 100% | 91% | 100% | 94% | 100% | 100% | |
55.7% | 71% | 75% | 33% | 50% | 42% | 0% | 63% | 59% | 67% | 50% | 46% | 79% | 55% | 71% | 75% | |
98.8% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 88% | 94% | 100% | |
77.3% | 91% | 91% | 88% | 88% | 88% | 31% | 84% | 63% | 84% | 72% | 78% | 88% | 63% | 81% | 69% |