INDEX
Explanations
mass mobilization, deportation, surveillance, death, nationalism, atrocities
New Auto-Interp
Negative Logits
inscri
1.41
defini
1.38
wykon
1.26
obten
1.25
adore
1.23
adiab
1.23
indique
1.22
esegu
1.20
ordine
1.20
indiqu
1.18
POSITIVE LOGITS
t
2.44
ми
1.60
uk
1.46
the
1.45
ä
1.42
tr
1.24
ni
1.23
á
1.23
z
1.21
д
1.20
Activations Density 0.004%