INDEX
Explanations
exclamatory punctuation and decorative symbols
Strings of non-alphanumeric characters
emphasis punctuation
New Auto-Interp
Negative Logits
-0.42
ی
-0.41
}{*}{-0.39
-0.38
typec
-0.38
tableFuture
-0.37
artigian
-0.37
ện
-0.36
light
-0.35
[
-0.35
POSITIVE LOGITS
!!!!!
0.99
!!!!!
0.98
!!!!
0.98
!!!!
0.96
!!!!!!
0.94
!!!"
0.94
?????
0.93
!!!
0.91
!!!!!!!!!
0.90
!!!)
0.90
Activations Density 0.738%