INDEX
Explanations
sentences ending with a period
punctuation, specifically periods indicating the end of sentences
New Auto-Interp
Negative Logits
veter
-0.77
!'
-0.71
mathemat
-0.67
preval
-0.66
reluct
-0.66
glim
-0.65
tremend
-0.65
unbeliev
-0.64
millenn
-0.63
unsus
-0.61
POSITIVE LOGITS
"'
1.78
"â̦
1.70
"(
1.68
"[
1.67
"
1.66
"...
1.65
".
1.44
""
1.30
"#
1.25
"@
1.22
Activations Density 0.165%