INDEX
Explanations
references to deportation and displacement of individuals, particularly in the context of political or humanitarian crises
New Auto-Interp
Negative Logits
clang
-0.16
McDon
-0.15
busy
-0.15
bast
-0.14
ackle
-0.14
491
-0.14
appa
-0.14
cow
-0.14
edis
-0.13
velle
-0.13
POSITIVE LOGITS
suspected
0.23
Err
0.18
thousands
0.18
captured
0.18
hundreds
0.18
err
0.17
citizens
0.17
refugees
0.17
Nationals
0.17
diss
0.17
Activations Density 0.154%