INDEX
Explanations
words related to actions and initiatives towards creating change or solving problems
phrases related to actions or plans aimed at improvement and assistance
New Auto-Interp
Negative Logits
Typ
-0.86
Printed
-0.84
Writ
-0.79
pict
-0.74
Strange
-0.73
prints
-0.72
videos
-0.70
Proud
-0.70
Kubrick
-0.70
Sour
-0.69
POSITIVE LOGITS
improve
1.66
mitigate
1.65
reduce
1.64
alleviate
1.60
strengthen
1.56
stabilize
1.55
avert
1.54
curb
1.52
stimulate
1.50
lessen
1.47
Activations Density 0.286%