INDEX
Explanations
phrases related to confinement or containment
references to confinement and restrictive living spaces
New Auto-Interp
Negative Logits
ILA
-0.71
Flavoring
-0.59
Coverage
-0.59
Vote
-0.58
ophy
-0.58
oice
-0.57
claimants
-0.56
ighth
-0.55
ammers
-0.55
Lauder
-0.55
POSITIVE LOGITS
attic
1.10
courtyard
1.07
surrounded
0.99
closet
0.97
hallway
0.94
basement
0.94
room
0.94
enclosure
0.93
freezer
0.92
aisle
0.90
Activations Density 0.376%