INDEX
Explanations
instances of periods followed by quotations or specific sequences of characters, potentially related to coding or formatting
punctuation marks and sentence endings
New Auto-Interp
Negative Logits
Aires
-0.70
favour
-0.68
xual
-0.67
agar
-0.62
POV
-0.61
Venom
-0.61
Princ
-0.61
rumours
-0.60
estates
-0.60
cit
-0.60
POSITIVE LOGITS
SHARE
1.11
Except
1.00
But
0.99
And
0.95
Yet
0.95
Especially
0.91
Then
0.89
Priv
0.89
That
0.89
Repeat
0.89
Activations Density 0.439%