INDEX
Explanations
locations within buildings
references to locations or places
New Auto-Interp
Negative Logits
oneself
-0.63
yourselves
-0.61
annot
-0.60
anson
-0.60
backlash
-0.59
laus
-0.58
phabet
-0.57
IOC
-0.56
trending
-0.56
Kraft
-0.56
POSITIVE LOGITS
overlooking
0.92
mates
0.86
balcony
0.85
apartment
0.84
mate
0.84
lair
0.84
doorstep
0.84
tips
0.82
porch
0.80
enclosure
0.79
Activations Density 0.132%