INDEX
Explanations
references to governance and political figures
New Auto-Interp
Negative Logits
derec
-0.17
(æ°´
-0.16
大åħ¨
-0.15
kabil
-0.14
Ïĩε
-0.14
ponsive
-0.14
ìĬµ
-0.14
malink
-0.14
iÄįe
-0.14
å°ij女
-0.13
POSITIVE LOGITS
author
0.21
head
0.18
director
0.17
Senior
0.16
author
0.16
(author
0.16
Senior
0.16
senior
0.16
Contrib
0.15
Contributor
0.15
Activations Density 0.126%