INDEX
Explanations
proper nouns associated with leadership or prominent figures
New Auto-Interp
Negative Logits
Wolver
-0.15
odox
-0.15
Blackburn
-0.14
azon
-0.14
ÄįÃŃ
-0.14
acco
-0.14
izza
-0.14
uard
-0.14
ió
-0.14
gress
-0.14
POSITIVE LOGITS
·
0.15
ĴĪ
0.14
zÄħd
0.14
Familie
0.14
ød
0.14
Andersen
0.14
ibilidade
0.13
capacit
0.13
rawer
0.13
ulet
0.13
Activations Density 0.363%