INDEX
Explanations
sentences concluding a point or with a strong emphasis
sentences that express finality or conclusions
New Auto-Interp
Negative Logits
itled
-0.67
ancy
-0.64
atorium
-0.62
marked
-0.61
aband
-0.60
aditional
-0.59
replacing
-0.59
assignment
-0.59
first
-0.58
phased
-0.58
POSITIVE LOGITS
Lastly
1.42
etc
1.38
Finally
1.27
Lastly
1.10
Finally
1.06
etc
1.03
Whatever
0.98
These
0.93
whatever
0.89
These
0.89
Activations Density 0.790%