INDEX
Explanations
calls to action or requests for public engagement in political matters
New Auto-Interp
Negative Logits
poz
-0.16
isser
-0.15
.setOutput
-0.15
Macy
-0.14
.AutoScale
-0.14
poz
-0.14
ucker
-0.14
baÅŁ
-0.14
izin
-0.13
mediate
-0.13
POSITIVE LOGITS
MP
0.63
MP
0.53
MPs
0.52
mp
0.47
Member
0.44
_MP
0.44
_mp
0.40
Mp
0.40
MPS
0.38
parliament
0.38
Activations Density 0.170%