INDEX
Explanations
terms related to war and conflict
references to countries in conflict and the associated humanitarian crises
New Auto-Interp
Negative Logits
basis
-0.80
OPLE
-0.72
breakfast
-0.68
decimal
-0.68
pencil
-0.67
Gifts
-0.65
decency
-0.65
finishes
-0.65
directives
-0.65
arts
-0.65
POSITIVE LOGITS
prone
1.71
ridden
1.62
torn
1.54
inducing
1.49
resistant
1.42
laden
1.41
affected
1.36
induced
1.34
rav
1.33
filled
1.26
Activations Density 0.063%