INDEX
Explanations
mentions of physical spaces such as closets or drawers
references to storage spaces like closets and drawers
New Auto-Interp
Negative Logits
vati
-0.81
uana
-0.69
iva
-0.68
onso
-0.67
odic
-0.66
idation
-0.66
urai
-0.66
nesty
-0.63
Interstitial
-0.62
GH
-0.62
POSITIVE LOGITS
closet
1.18
pedia
0.83
shelves
0.79
leans
0.76
ories
0.76
glers
0.74
cleaners
0.73
doors
0.73
rities
0.70
nuts
0.69
Activations Density 0.016%