INDEX
Explanations
references to artists, their works, and historical figures in various contexts
New Auto-Interp
Negative Logits
vier
-0.16
Named
-0.14
å°¾
-0.13
Ñijм
-0.13
xaa
-0.13
ÏĢοÏĦε
-0.13
ê³
-0.13
helm
-0.13
fal
-0.13
Parking
-0.12
POSITIVE LOGITS
188
0.23
born
0.23
189
0.21
192
0.20
186
0.20
190
0.20
184
0.19
died
0.18
183
0.18
Born
0.18
Activations Density 0.116%