A test is fully passed only if every run passed for that test.Wrong answer: 2Response Time (avg)20.80sResponse Time (max)88.68sResponse Time (total)270.46s…
Anti-AI Tricks
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)7.90sResponse Time (max)9.52sResponse Time (total)15.80s
Coding
: 7.0 A test is fully passed only if every run passed for that test.Wrong answer: 1Response Time (avg)54.28sResponse Time (max)88.68sResponse Time (total)108.56s
Combined
: 9.5 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)40.61sResponse Time (max)40.61sResponse Time (total)40.61s
Data parsing and extraction
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)7.72sResponse Time (max)7.72sResponse Time (total)7.72s
Domain specific
: 7.7 A test is fully passed only if every run passed for that test.Wrong answer: 1Response Time (avg)32.73sResponse Time (max)32.73sResponse Time (total)32.73s
General Intelligence
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)11.77sResponse Time (max)11.77sResponse Time (total)11.77s
Instructions following
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)9.56sResponse Time (max)9.56sResponse Time (total)9.56s
Puzzle Solving
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)7.15sResponse Time (max)8.49sResponse Time (total)14.30s
Tool Calling
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)23.15sResponse Time (max)23.15sResponse Time (total)23.15s
Trivia
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)6.27sResponse Time (max)6.27sResponse Time (total)6.27s
A test is fully passed only if every run passed for that test.Wrong answer: 1Response Time (avg)8.30sResponse Time (max)34.82sResponse Time (total)165.92s…
Anti-AI Tricks
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)2.57sResponse Time (max)3.60sResponse Time (total)10.27s
Coding
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)24.62sResponse Time (max)34.82sResponse Time (total)49.24s
Combined
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)22.37sResponse Time (max)22.37sResponse Time (total)22.37s
Data parsing and extraction
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)6.43sResponse Time (max)8.51sResponse Time (total)12.87s
Domain specific
: 7.6 A test is fully passed only if every run passed for that test.Wrong answer: 1Response Time (avg)14.09sResponse Time (max)22.00sResponse Time (total)42.27s
General Intelligence
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)3.63sResponse Time (max)3.63sResponse Time (total)3.63s
Instructions following
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)3.35sResponse Time (max)3.42sResponse Time (total)6.69s
Puzzle Solving
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)3.23sResponse Time (max)3.68sResponse Time (total)9.69s
Tool Calling
: 9.8 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)4.96sResponse Time (max)4.96sResponse Time (total)4.96s
Trivia
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)3.94sResponse Time (max)3.94sResponse Time (total)3.94s
A test is fully passed only if every run passed for that test.Wrong answer: 1Response Time (avg)16.72sResponse Time (max)117.26sResponse Time (total)334.36s…
Anti-AI Tricks
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)3.88sResponse Time (max)5.73sResponse Time (total)15.53s
Coding
: 7.9 A test is fully passed only if every run passed for that test.Wrong answer: 1Response Time (avg)95.96sResponse Time (max)117.26sResponse Time (total)191.92s
Combined
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)22.42sResponse Time (max)22.42sResponse Time (total)22.42s
Data parsing and extraction
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)5.43sResponse Time (max)6.18sResponse Time (total)10.86s
Domain specific
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)15.27sResponse Time (max)34.09sResponse Time (total)45.80s
General Intelligence
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)5.19sResponse Time (max)5.19sResponse Time (total)5.19s
Instructions following
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)4.04sResponse Time (max)4.70sResponse Time (total)8.08s
Puzzle Solving
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)5.48sResponse Time (max)7.24sResponse Time (total)16.45s
Tool Calling
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)12.60sResponse Time (max)12.60sResponse Time (total)12.60s
Trivia
: 10.0 A test is fully passed only if every run passed for that test.No failed answers.Response Time (avg)5.50sResponse Time (max)5.50sResponse Time (total)5.50s