INDEX
Explanations
instances of being physically confined or restricted
instances of the word "locked" and its variations, indicating confinement or restriction
New Auto-Interp
Negative Logits
exaggeration
-0.72
ahon
-0.70
Interpret
-0.69
aste
-0.68
Footnote
-0.67
plot
-0.66
Insp
-0.65
Cosponsors
-0.65
eness
-0.65
brate
-0.65
POSITIVE LOGITS
locked
3.45
Locked
2.22
lock
2.06
locked
2.01
locking
2.01
unlocked
1.97
locks
1.91
Lock
1.54
chained
1.51
Lock
1.46
Activations Density 0.018%