INDEX
Explanations
phrases related to threats or potential negative impacts
references to threats or damage to rights and institutions
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.79
omer
-0.73
hots
-0.71
owler
-0.71
oaded
-0.70
placed
-0.69
--+
-0.69
arate
-0.68
atoon
-0.68
tackle
-0.68
POSITIVE LOGITS
integrity
1.53
livelihood
1.48
viability
1.47
credibility
1.31
wellbeing
1.25
lives
1.25
stability
1.24
validity
1.23
reliability
1.20
effectiveness
1.19
Activations Density 0.211%