INDEX
Explanations
phrases focusing on accountability and responsibility in governance and societal issues
New Auto-Interp
Negative Logits
rather
-0.20
none
-0.17
instead
-0.16
both
-0.16
more
-0.16
anders
-0.16
(
-0.15
ary
-0.15
ima
-0.15
181
-0.14
POSITIVE LOGITS
بÙĦÚ©Ùĩ
0.20
anymore
0.19
à¹ģà¸Ħ
0.18
plusplus
0.18
ä»ħ
0.17
Affected
0.17
LIMITED
0.17
affected
0.17
limited
0.17
sondern
0.16
Activations Density 0.039%