INDEX
Explanations
contractions and negations
phrases emphasizing conditional situations or statements
New Auto-Interp
Negative Logits
NetMessage
-0.68
nesota
-0.63
onement
-0.63
.............
-0.63
Royale
-0.63
etts
-0.61
Excellence
-0.59
Selection
-0.59
Mobility
-0.58
Shame
-0.57
POSITIVE LOGITS
technically
0.94
disagree
0.81
disagreed
0.79
trivial
0.78
momentarily
0.77
admittedly
0.73
physically
0.73
slight
0.73
otherwise
0.71
consciously
0.71
Activations Density 0.175%