INDEX
Explanations
mentions of urban-related terms or entities
mentions of the word "Urban"
New Auto-Interp
Negative Logits
nesota
-0.82
utes
-0.72
utations
-0.71
tub
-0.70
hered
-0.70
ivity
-0.69
kens
-0.69
onement
-0.65
arers
-0.65
oned
-0.65
POSITIVE LOGITS
Ñĭ
0.90
owitz
0.74
oscope
0.73
Dictionary
0.73
³
0.71
Moving
0.71
ulus
0.69
atio
0.68
а
0.68
ipal
0.67
Activations Density 0.031%