INDEX
Explanations
words related to minimizing, reducing, or preventing
actions or concepts associated with reducing negative impacts or minimizing risks
New Auto-Interp
Negative Logits
ker
-0.87
enegger
-0.85
king
-0.81
swick
-0.76
otle
-0.75
gob
-0.74
join
-0.73
cart
-0.71
leader
-0.69
worldly
-0.67
POSITIVE LOGITS
imize
0.92
minimizing
0.82
minimize
0.80
distractions
0.78
amounts
0.77
maximizing
0.77
minimized
0.71
misunderstand
0.71
utilization
0.71
imal
0.71
Activations Density 0.034%