INDEX
Explanations
indicators of political discourse and conflicts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.04
3:0.06
4:0.21
5:0.03
6:0.02
7:0.37
8:0.03
9:0.03
10:0.07
11:0.05
Negative Logits
thood
-1.93
owment
-1.84
apsed
-1.84
priority
-1.78
pleted
-1.76
fecture
-1.74
itialized
-1.70
igible
-1.69
acht
-1.68
ocamp
-1.65
POSITIVE LOGITS
assertions
2.15
assertion
1.93
baseless
1.90
prevailing
1.89
criticism
1.89
cynicism
1.88
questioning
1.87
criticisms
1.87
contradiction
1.85
accusation
1.84
Activations Density 0.000%