INDEX
Explanations
mentions of urban development and changes in a neighborhood
the word "are" indicating existence or presence
New Auto-Interp
Negative Logits
iates
-0.63
urry
-0.63
ossom
-0.61
iture
-0.58
ð
-0.56
dismant
-0.56
ises
-0.56
ipop
-0.56
viz
-0.55
osate
-0.55
POSITIVE LOGITS
senal
1.11
wolves
1.04
wolf
0.93
hereby
0.88
nt
0.79
gonna
0.76
supposed
0.72
not
0.72
definitely
0.71
generally
0.70
Activations Density 0.304%