INDEX
Explanations
mentions of individuals or groups experiencing displacements due to various circumstances
terms related to displacement and refugees
New Auto-Interp
Negative Logits
ature
-0.81
vern
-0.75
atur
-0.72
acca
-0.71
bara
-0.70
getic
-0.68
ramid
-0.65
osterone
-0.65
ysis
-0.64
tarian
-0.63
POSITIVE LOGITS
refugees
0.96
displaced
0.88
migrants
0.81
refugee
0.79
fleeing
0.77
Refugees
0.77
orphans
0.74
pora
0.71
displacement
0.71
Refugee
0.70
Activations Density 0.031%