INDEX
Explanations
punctuation marks with a focus on periods
periods at the end of sentences
New Auto-Interp
Negative Logits
brill
-0.79
nodd
-0.75
abouts
-0.74
gobl
-0.72
ŃĶ
-0.72
tradem
-0.71
babe
-0.70
canv
-0.70
congr
-0.67
enthusi
-0.67
POSITIVE LOGITS
Consequently
1.72
Thus
1.63
Such
1.58
Conversely
1.58
This
1.56
Furthermore
1.55
Moreover
1.53
Hence
1.53
Therefore
1.49
These
1.48
Activations Density 0.389%