INDEX
Explanations
proper nouns related to political figures or entities
New Auto-Interp
Negative Logits
estone
-0.77
lez
-0.73
oor
-0.73
ords
-0.72
estones
-0.70
away
-0.67
abouts
-0.67
zl
-0.66
stone
-0.66
oÄŁ
-0.66
POSITIVE LOGITS
Mike
0.99
Vincent
0.93
Jim
0.92
Denis
0.91
Colin
0.91
Steve
0.90
Philip
0.89
Sue
0.88
Gerard
0.88
Leslie
0.88
Activations Density 0.111%