INDEX
Explanations
words related to hindering or preventing something
the word "deter" in various contexts
New Auto-Interp
Negative Logits
ammy
-0.78
olini
-0.78
ocene
-0.75
halls
-0.73
rooft
-0.67
hetti
-0.65
ioch
-0.64
ocalypse
-0.64
appointments
-0.62
openings
-0.60
POSITIVE LOGITS
ministic
1.84
minist
1.50
rence
1.05
gent
1.00
ior
0.96
ring
0.94
ency
0.87
rer
0.87
red
0.86
deter
0.85
Activations Density 0.018%