INDEX
Explanations
mentions of immigration and immigrants
New Auto-Interp
Negative Logits
rian
-0.17
ria
-0.16
Hack
-0.16
aska
-0.15
clusive
-0.15
ver
-0.15
ki
-0.14
ger
-0.14
iers
-0.14
double
-0.14
POSITIVE LOGITS
LIKELY
0.17
ê
0.15
ceptar
0.15
/english
0.15
bracht
0.15
licken
0.14
iteDatabase
0.14
adil
0.14
rada
0.14
["$
0.14
Activations Density 0.006%