INDEX
Explanations
phrases and terms related to governance and political accountability
New Auto-Interp
Negative Logits
ĥ½
-0.16
bÃło
-0.15
æĶ»
-0.15
лл
-0.14
iske
-0.14
à¸Ħรà¸ļ
-0.14
à¥ģब
-0.14
pesan
-0.14
avan
-0.14
_lista
-0.14
POSITIVE LOGITS
after
0.22
already
0.19
AFTER
0.17
after
0.16
immediately
0.16
Coder
0.16
right
0.15
inde
0.15
después
0.15
apr
0.15
Activations Density 0.013%