INDEX
Explanations
elements related to humanitarian crises and international conflicts
New Auto-Interp
Negative Logits
hus
-0.16
strup
-0.15
sticks
-0.15
976
-0.14
pole
-0.14
ulas
-0.14
kits
-0.14
lom
-0.14
esis
-0.14
585
-0.14
POSITIVE LOGITS
Afghanistan
0.32
Iraq
0.29
Syria
0.29
conflicts
0.24
Gaza
0.23
Yemen
0.23
Iraq
0.23
Ukraine
0.22
Afghan
0.21
Af
0.21
Activations Density 0.172%