INDEX
Explanations
names of famous individuals or characters commonly associated with a specific movie or franchise
characters and names from various forms of entertainment media
New Auto-Interp
Negative Logits
Reviewed
-0.68
mouse
-0.67
ources
-0.64
Krug
-0.64
rooting
-0.62
steer
-0.60
mitt
-0.60
xual
-0.59
orescent
-0.58
[*]
-0.58
POSITIVE LOGITS
etus
0.96
oglu
0.81
nova
0.79
onen
0.79
zona
0.77
esi
0.75
atari
0.73
Lago
0.72
hyde
0.69
ensis
0.67
Activations Density 0.605%