INDEX
Explanations
geographical locations, specifically cities and universities
names of specific locations and terms related to immigration and refugee issues
New Auto-Interp
Negative Logits
Cu
-0.73
buster
-0.68
hij
-0.64
iling
-0.63
telev
-0.62
Detective
-0.62
conglomer
-0.61
filler
-0.61
ijn
-0.60
Lau
-0.60
POSITIVE LOGITS
throp
2.45
Arbor
2.25
Refugees
1.94
kept
1.23
Kabul
1.07
ickr
1.06
uties
1.04
wcsstore
1.02
atche
1.00
humane
0.96
Activations Density 0.049%