INDEX
Explanations
words related to physical locations and items with some focus on negative aspects
New Auto-Interp
Negative Logits
utical
-0.66
Flavoring
-0.66
illin
-0.61
iosyncr
-0.58
byter
-0.58
ominated
-0.58
emn
-0.57
compl
-0.57
apego
-0.57
inyl
-0.57
POSITIVE LOGITS
vicinity
0.90
drawer
0.85
freezer
0.81
attic
0.76
aisle
0.76
compartment
0.76
courtyard
0.76
closet
0.75
area
0.74
cage
0.72
Activations Density 0.281%