INDEX
Explanations
terms related to governance and political themes
New Auto-Interp
Negative Logits
ÑĦек
-0.18
_MISC
-0.16
olla
-0.16
uen
-0.16
-notes
-0.15
appa
-0.15
nota
-0.15
анка
-0.15
emd
-0.15
asset
-0.15
POSITIVE LOGITS
ниÑĨе
0.22
ke
0.20
ÑĦоÑĢме
0.20
ake
0.20
kke
0.19
iture
0.19
cce
0.18
bove
0.18
ке
0.18
iske
0.18
Activations Density 0.007%