INDEX
Explanations
words related to suppression, control, and restraint
instances of the word "suppress" and its variations
New Auto-Interp
Negative Logits
sky
-0.73
owitz
-0.72
aldo
-0.70
member
-0.70
gain
-0.69
giving
-0.69
venture
-0.68
Drawn
-0.67
verty
-0.66
¯¯
-0.66
POSITIVE LOGITS
ively
0.97
suppression
0.91
suppressing
0.86
suppressed
0.86
suppress
0.85
inhib
0.79
muzzle
0.75
impulses
0.75
distractions
0.74
emotions
0.70
Activations Density 0.026%