INDEX
Explanations
instances of people in leadership or notable roles
New Auto-Interp
Negative Logits
én
-0.16
wn
-0.15
LP
-0.15
acr
-0.15
Throne
-0.15
oda
-0.15
mdb
-0.14
ounder
-0.14
hm
-0.14
ammers
-0.13
POSITIVE LOGITS
323
0.15
erdale
0.14
rada
0.14
ÑģкоÑĢ
0.13
ÏĦÏħ
0.13
kyt
0.13
PoÄįet
0.13
Kit
0.13
hangi
0.13
çłģ
0.13
Activations Density 0.043%