INDEX
Explanations
references to positions of authority or leadership roles
New Auto-Interp
Negative Logits
grily
-0.19
ÑĥÑĢÑģ
-0.17
sik
-0.16
eso
-0.15
اÙĨس
-0.15
ought
-0.15
riad
-0.14
δο
-0.14
sak
-0.14
existent
-0.14
POSITIVE LOGITS
person
0.47
persons
0.34
woman
0.32
manship
0.27
PERSON
0.26
mans
0.25
person
0.24
man
0.23
lift
0.22
Person
0.22
Activations Density 0.014%