INDEX
Explanations
words related to decrease or reduced actions or states
terms related to reduction or decline in various contexts
New Auto-Interp
Negative Logits
ansas
-0.72
wered
-0.68
giving
-0.64
oa
-0.63
ocratic
-0.62
Bio
-0.61
Aval
-0.61
Issue
-0.61
types
-0.60
Reviewed
-0.60
POSITIVE LOGITS
decrease
0.85
cember
0.84
reduction
0.75
decreases
0.75
decreasing
0.71
decreased
0.71
uce
0.70
(-
0.70
Decre
0.69
inhibited
0.68
Activations Density 0.021%