INDEX
Explanations
text related to political matters, particularly discussions about democracy, investigations, political ideologies, and governmental actions
New Auto-Interp
Negative Logits
llor
-0.67
killed
-0.64
sacked
-0.63
awei
-0.61
wine
-0.60
CN
-0.60
interrupted
-0.57
HAEL
-0.57
shalt
-0.57
lodged
-0.56
POSITIVE LOGITS
roads
1.19
regards
1.19
clusions
1.16
spite
1.14
accordance
1.07
lieu
1.07
efficiency
1.05
versions
1.04
relation
1.03
regard
1.02
Activations Density 6.963%