INDEX
Explanations
phrases related to physical locations, specifically basements
references to basements or cellar spaces
New Auto-Interp
Negative Logits
nir
-0.78
acting
-0.78
utan
-0.77
Cola
-0.76
cific
-0.72
uncture
-0.71
itia
-0.71
uality
-0.70
HCR
-0.68
acted
-0.68
POSITIVE LOGITS
basement
1.14
stairs
0.94
lid
0.91
cellar
0.86
hatch
0.86
Dwell
0.85
courtyard
0.85
attic
0.84
closet
0.82
bedroom
0.80
Activations Density 0.009%