INDEX
Explanations
references to governmental policies and political parties
New Auto-Interp
Negative Logits
nationwide
-0.15
ystick
-0.15
ONGO
-0.15
anmar
-0.15
entityId
-0.14
Nationwide
-0.14
ophil
-0.14
imper
-0.14
national
-0.14
ahrung
-0.14
POSITIVE LOGITS
dev
0.29
Assembly
0.23
Assembly
0.21
Dev
0.20
dev
0.20
Dev
0.20
-dev
0.19
assembly
0.18
DEV
0.17
esome
0.17
Activations Density 0.015%