INDEX
Explanations
information about where people live
phrases that indicate where people reside
New Auto-Interp
Negative Logits
umbs
-0.79
elight
-0.79
ery
-0.76
oug
-0.75
xual
-0.73
anche
-0.71
ion
-0.70
aptic
-0.66
ional
-0.65
sonian
-0.65
POSITIVE LOGITS
upstairs
0.92
vic
0.87
downstairs
0.85
lihood
0.81
chool
0.78
stead
0.77
indoors
0.75
abroad
0.74
paycheck
0.74
peacefully
0.71
Activations Density 0.036%