INDEX
Explanations
roles and titles associated with positions of authority or governance
New Auto-Interp
Negative Logits
alone
-0.16
edor
-0.14
.vn
-0.14
which
-0.14
άλ
-0.14
βε
-0.13
ständ
-0.13
گست
-0.13
oro
-0.13
itself
-0.13
POSITIVE LOGITS
/
0.21
åħ¼
0.21
Emer
0.20
–
0.19
-
0.19
&
0.18
cum
0.17
cum
0.17
/System
0.16
&
0.15
Activations Density 0.079%