INDEX
Explanations
phrases related to buildings and their physical features
references to multiple-story buildings
New Auto-Interp
Negative Logits
ateurs
-0.81
ugal
-0.80
ggies
-0.77
istered
-0.77
ournal
-0.74
uters
-0.74
estern
-0.71
ensable
-0.71
ocol
-0.69
eeks
-0.68
POSITIVE LOGITS
story
1.11
telling
1.02
LIN
0.91
line
0.88
LINE
0.80
lined
0.79
glass
0.79
Stories
0.78
Story
0.77
lines
0.76
Activations Density 0.009%