INDEX
Explanations
phrases related to prevention or inhibiting actions
instances of the word "prevent."
New Auto-Interp
Negative Logits
ammy
-0.87
eah
-0.68
edded
-0.68
enegger
-0.66
MON
-0.63
inging
-0.62
geist
-0.62
swick
-0.62
Word
-0.61
elt
-0.60
POSITIVE LOGITS
ative
1.04
regress
0.83
detection
0.79
ively
0.78
duplicate
0.78
accidental
0.73
duplication
0.73
pregnancies
0.73
ministic
0.73
reprodu
0.71
Activations Density 0.034%