INDEX
Explanations
phrases that mention physical or digital locks
instances of the word "lock" in various contexts
New Auto-Interp
Negative Logits
ANN
-0.61
orally
-0.59
oun
-0.58
hetical
-0.58
appreci
-0.58
inacc
-0.57
anqu
-0.56
intervening
-0.56
issan
-0.55
cognition
-0.54
POSITIVE LOGITS
lock
1.22
locking
1.00
heed
0.97
lear
0.92
locks
0.91
picking
0.88
er
0.86
eries
0.84
trap
0.81
itus
0.79
Activations Density 0.006%