INDEX
Explanations
punctuation marks, especially periods
sentence-ending punctuation, specifically focusing on periods and commas
New Auto-Interp
Negative Logits
proport
-0.82
oun
-0.78
landfall
-0.78
commer
-0.74
afterlife
-0.73
warr
-0.73
isphere
-0.72
consolidation
-0.69
undermin
-0.68
casualty
-0.68
POSITIVE LOGITS
Then
0.99
Later
0.90
Exactly
0.90
Asked
0.90
"_
0.87
Something
0.86
org
0.85
********************************
0.85
Suddenly
0.84
Yeah
0.84
Activations Density 0.168%