INDEX
Explanations
references to decisions, actions, or opinions expressed with a negative emotional tone, possibly related to criticism or rejection
comparisons or contrasts between past and present conditions or roles
New Auto-Interp
Negative Logits
verty
-0.68
Led
-0.67
olate
-0.64
aez
-0.62
atures
-0.60
enos
-0.59
soType
-0.58
uben
-0.57
uid
-0.57
ixtures
-0.57
POSITIVE LOGITS
altogether
1.21
entirely
0.94
outright
0.80
attempts
0.73
precon
0.69
nonsense
0.68
pret
0.67
objection
0.67
objections
0.67
attempt
0.67
Activations Density 14.737%