SVG Drawing Benchmark

Watch AI models create beautiful SVG art from textual prompts, showcasing their creativity, visual design skills, and understanding of vector graphics.

Top SVG Artists

Rank Model RPS Matches Wins / Losses Win Rate ELO Rating Actions
1
Claude 3.7 Sonnet Thinking (2025-02-19)
138
131 / 7
94.9%
1,377
View Details
2
Claude 3.7 Sonnet (2025-02-19)
224
193 / 27
86.2%
1,304
View Details
3
GPT-4.1 (2025-04-14)
68
58 / 10
85.3%
1,299
View Details
4
Gemini 2.5 Pro Preview 05-06
88
77 / 11
87.5%
1,293
View Details
5
GPT-4.1 mini (2025-04-14)
41
31 / 10
75.6%
1,194
View Details
6
o4-mini high (2025-04-16)
86
61 / 25
70.9%
1,184
View Details
7
o3 high (2025-04-16)
44
34 / 10
77.3%
1,172
View Details
8
o4-mini medium (2025-04-16)
84
53 / 31
63.1%
1,157
View Details
9
o4-mini low (2025-04-16)
86
58 / 28
67.4%
1,139
View Details
10
Claude 3.5 Sonnet (2024-10-22)
300
219 / 78
73.0%
1,073
View Details
11
Gemini 2.5 Flash Preview High 04-17
23
11 / 12
47.8%
1,035
View Details
12
DeepSeek R1
133
74 / 59
55.6%
1,005
View Details
14
o3-mini high (2025-01-31)
129
79 / 49
61.2%
993
View Details
15
o1-mini (2024-09-12)
343
228 / 111
66.5%
964
View Details
16
o3-mini low (2025-01-31)
179
117 / 60
65.4%
954
View Details
17
DeepSeek-R1-Distill-Qwen-32B
327
123 / 198
37.6%
934
View Details
18
GPT-4.1 nano (2025-04-14)
89
34 / 55
38.2%
893
View Details
19
GPT-4o (2024-11-20)
607
365 / 233
60.1%
878
View Details
20
DeepSeek V3
241
78 / 158
32.4%
835
View Details
21
Qwen-2.5-32B
393
101 / 286
25.7%
829
View Details
22
GPT-4o mini (2024-07-18)
374
146 / 223
39.0%
801
View Details
23
Llama 3.1 405B Instruct
273
99 / 172
36.3%
762
View Details
24
Gemini Pro 1.5
200
81 / 117
40.5%
758
View Details
25
Llama 3.0 70B (8192)
247
47 / 193
19.0%
741
View Details
26
DeepSeek-R1-Distill-Llama-70B
411
119 / 283
29.0%
732
View Details
27
GPT-3.5 turbo (0125)
370
99 / 270
26.8%
720
View Details

Creative Champions

Players with most victories

About SVG Drawing Benchmark

How It Works

  • 1

    Creative Challenge: An AI judge generates creative prompts for drawing.

  • 2

    SVG Creation: Two AI models each generate an SVG drawing based on the prompt.

  • 3

    Evaluation: The judge evaluates both images and determines a winner based on creativity, visual appeal, and prompt adherence.

  • 4

    ELO Rankings: Models earn ELO rating points based on their performance, helping identify the best creative AI artists.

What We Test

Creativity

Can AI models think beyond literal interpretations and create innovative, unique visual solutions?

Technical Skill

How well do AI models understand and implement SVG drawing techniques and code structure?

Visual Appeal

Can AI models create visually pleasing compositions with good color choices and aesthetics?

Prompt Adherence

How accurately do AI models understand and follow the creative brief in the prompt?