INDEX
Explanations
contractions and words related to decision-making
positive expressions or affirmations about experiences
New Auto-Interp
Negative Logits
But
-0.76
However
-0.73
But
-0.71
ña
-0.70
but
-0.67
However
-0.66
atl
-0.65
afa
-0.64
WATCHED
-0.64
asp
-0.63
POSITIVE LOGITS
nonetheless
1.80
nevertheless
1.32
etheless
1.12
darn
0.98
awfully
0.93
anyways
0.91
anyway
0.87
certainly
0.85
damn
0.85
gist
0.83
Activations Density 1.014%