Rock Paper Scissors Match #233

Statistical Tie

This match is considered a tie even though the scores differ by 12 points. With 88 decisive rounds (Rounds not ending in a tie), that gap is not large enough to be statistically significant at 90 % confidence.

At this sample size, any difference below 12.0 points can still be explained by random chance rather than player skill.

Move Distribution

Analysis of move choices by each player

Llama 3.0 70B (8192)

Rock 10
9.3%
Paper 3
2.8%
Scissors 95
88.0%

GPT-3.5 turbo (0125)

Rock 48
44.4%
Paper 38
35.2%
Scissors 22
20.4%

Strategy Analysis

Performance insights from the match

Win Streaks

Llama 3.0 70B (8192)
4
consecutive wins
GPT-3.5 turbo (0125)
4
consecutive wins

Strategic Insights

This match ended in a tie, with both models demonstrating equally effective strategies. The distribution of moves suggests a balanced approach from both players. The low tie rate (19.3%) indicates that the models were using distinctly different strategies, rarely making the same move.

Cumulative Wins

Win progress throughout the match

Llama 3.0 70B (8192)
GPT-3.5 turbo (0125)

Win Percentage Over Time

Win rate progression through rounds

Llama 3.0 70B (8192)
GPT-3.5 turbo (0125)

Round-by-Round Results

# P1 P2 Result
1
P2
2
P2
3
Tie
4
Tie
5
P1
6
P2
7
Tie
8
P1
9
P1
10
P1
11
P2
12
P1
13
P1
14
P2
15
P1
16
P2
17
P1
18
Tie
19
P1
20
P1
21
Tie
22
P2
23
P1
24
P2
25
P2
26
Tie
27
P2
28
P2
29
P1
30
P2
31
P2
32
P1
33
P1
34
Tie
35
P2
36
P1
37
Tie
38
P1
39
P2
40
P2
41
P2
42
P2
43
Tie
44
P2
45
P2
46
P2
47
P2
48
P1
49
P1
50
Tie
51
P2
52
P1
53
Tie
54
P2
55
P2
56
P2
57
P2
58
P1
59
Tie
60
P1
61
P1
62
P1
63
P1
64
P2
65
P2
66
P2
67
P1
68
P2
69
P1
70
P2
71
P2
72
P2
73
Tie
74
Tie
75
P1
76
P2
77
P1
78
P1
79
Tie
80
Tie
81
P2
82
Tie
83
P1
84
P1
85
P2
86
P2
87
P2
88
Tie
89
P1
90
P1
91
P2
92
P1
93
P2
94
P1
95
P2
96
P2
97
P2
98
P1
99
P1
100
P2
101
Tie
102
P2
103
P2
104
P2
105
P2
106
Tie
107
P1
108
P2

Similar Matches