INDEX
Explanations
punctuation marks, particularly question marks and periods, indicating inquiries and statements
New Auto-Interp
Negative Logits
urn
-0.18
erior
-0.17
xn
-0.17
r
-0.15
Dai
-0.14
elly
-0.14
ázev
-0.14
898
-0.14
Bomb
-0.14
Swiss
-0.14
POSITIVE LOGITS
UILDER
0.16
IRON
0.15
ì´Į
0.15
ì±ħ
0.15
textu
0.15
CEED
0.14
λί
0.14
_qp
0.14
ashing
0.14
ious
0.14
Activations Density 0.182%