INDEX
Explanations
references to the brand Polo, particularly in relation to clothing and sports
New Auto-Interp
Negative Logits
okie
-0.15
quat
-0.14
martial
-0.14
é϶
-0.14
Projectile
-0.14
inja
-0.14
wrestling
-0.14
aina
-0.14
Martial
-0.13
Candle
-0.13
POSITIVE LOGITS
polo
0.41
Polo
0.36
pon
0.26
horses
0.25
pony
0.24
Cart
0.24
Snow
0.22
horse
0.21
Snow
0.21
Pony
0.21
Activations Density 0.001%