INDEX
Explanations
the name "Guido" with varying degrees of activation
the occurrences of the name "Guido"
New Auto-Interp
Negative Logits
REE
-0.79
Asheville
-0.74
rals
-0.69
Atmospheric
-0.69
ardless
-0.69
lees
-0.67
ILCS
-0.65
д
-0.64
birth
-0.64
Canadians
-0.63
POSITIVE LOGITS
ido
0.90
estro
0.90
odo
0.86
utsche
0.84
jit
0.83
ctor
0.81
atri
0.79
zilla
0.77
emonium
0.75
roid
0.75
Activations Density 0.007%