INDEX
Explanations
references to migrants and asylum seekers
New Auto-Interp
Negative Logits
seg
-0.17
Ramp
-0.17
chner
-0.16
abb
-0.15
yme
-0.15
(tol
-0.15
visite
-0.15
.epam
-0.15
øy
-0.14
ỹ
-0.14
POSITIVE LOGITS
crossing
0.40
Crossing
0.35
crossings
0.35
crossed
0.32
-cross
0.31
cross
0.31
Cross
0.29
cross
0.28
_cross
0.27
CROSS
0.27
Activations Density 0.025%