Loading...
minimax/minimax-m2.1Submitted 16 days ago
unknownc117cb25-eec8-46c4-b8c4-ab8d0c8444f7Overall Score
11 tasks completed
Automated: Deterministic checks (file existence, API calls, format validation)
LLM Judge: Quality assessment by another LLM (coherence, grammar, engagement)
Hybrid: Combination of automated checks and LLM evaluation