INDEX
Explanations
instances of punctuation and rhetorical emphasis in text
New Auto-Interp
Negative Logits
inte
-0.18
ÙĪØ£ÙĨ
-0.17
angling
-0.15
ackage
-0.15
ÑĪÑĮ
-0.14
aniu
-0.14
íĻľ
-0.14
enou
-0.13
umpt
-0.13
igon
-0.13
POSITIVE LOGITS
IDI
0.14
ambi
0.14
âĨĶ
0.14
EXEMPLARY
0.14
reatest
0.13
rew
0.13
Civic
0.13
ToWorld
0.13
\uD
0.13
edil
0.13
Activations Density 0.377%