Rock Paper Scissors Match #43

Statistical Tie

This match is considered a tie even though the scores differ by 7 points. With 93 decisive rounds (Rounds not ending in a tie), that gap is not large enough to be statistically significant at 90 % confidence.

At this sample size, any difference below 12.4 points can still be explained by random chance rather than player skill.

Move Distribution

Analysis of move choices by each player

Llama 3.0 70B (8192)

Rock 2
2.0%
Paper 15
15.0%
Scissors 83
83.0%

o3-mini low (2025-01-31)

Rock 64
64.0%
Paper 31
31.0%
Scissors 5
5.0%

Strategy Analysis

Performance insights from the match

Win Streaks

Llama 3.0 70B (8192)
6
consecutive wins
o3-mini low (2025-01-31)
7
consecutive wins

Strategic Insights

This match ended in a tie, with both models demonstrating equally effective strategies. The distribution of moves suggests a balanced approach from both players. The low tie rate (7.9%) indicates that the models were using distinctly different strategies, rarely making the same move.

Cumulative Wins

Win progress throughout the match

Llama 3.0 70B (8192)
O3-mini low (2025-01-31)

Win Percentage Over Time

Win rate progression through rounds

Llama 3.0 70B (8192)
O3-mini low (2025-01-31)

Round-by-Round Results

# P1 P2 Result
1
Tie
2
P1
3
P2
4
P2
5
P1
6
P2
7
P1
8
P2
9
P1
10
P2
11
P2
12
P2
13
P1
14
P2
15
P1
16
Tie
17
P1
18
P1
19
P2
20
P2
21
P2
22
P1
23
P2
24
P2
25
P2
26
P2
27
P2
28
P2
29
P2
30
P1
31
Tie
32
P2
33
P2
34
P2
35
P1
36
P1
37
P1
38
P1
39
P1
40
P1
41
P2
42
P2
43
P1
44
P2
45
P1
46
Tie
47
P1
48
P1
49
P2
50
P1
51
P1
52
Tie
53
P2
54
P1
55
P1
56
P1
57
P1
58
P2
59
P1
60
P1
61
P2
62
P2
63
P1
64
P1
65
P2
66
P2
67
P2
68
P2
69
P2
70
P1
71
P1
72
P1
73
P2
74
P2
75
P1
76
P2
77
P2
78
P2
79
P2
80
P1
81
P2
82
P1
83
Tie
84
P1
85
Tie
86
P2
87
P2
88
P1
89
P1
90
P2
91
P1
92
P2
93
P2
94
P2
95
P2
96
P2
97
P1
98
P1
99
P1
100
P2

Similar Matches