INDEX
Explanations
sentences expressing opinions or conclusions
punctuation marks and their usage within sentences
New Auto-Interp
Negative Logits
adal
-0.77
looted
-0.75
hatch
-0.70
formally
-0.69
acco
-0.68
undet
-0.66
ozyg
-0.65
spor
-0.65
purch
-0.64
everal
-0.64
POSITIVE LOGITS
Sounds
1.10
Doesn
1.10
That
1.09
Otherwise
1.09
Maybe
1.07
Yeah
1.07
Obviously
1.05
Especially
1.05
Wouldn
1.05
And
1.04
Activations Density 0.636%