INDEX
Explanations
phrases related to social or political exile
mentions of exile or being exiled
New Auto-Interp
Negative Logits
urgy
-0.85
yth
-0.81
anooga
-0.79
thora
-0.79
umbers
-0.76
ippi
-0.74
icrobial
-0.73
rations
-0.72
amins
-0.71
adders
-0.70
POSITIVE LOGITS
exile
1.28
exiled
0.92
banished
0.89
dissidents
0.86
haun
0.80
abroad
0.74
confinement
0.73
Guant
0.70
persecut
0.70
blogging
0.69
Activations Density 0.011%