INDEX
Explanations
instances of quotation marks or related punctuation in the text
New Auto-Interp
Negative Logits
ůl
-0.16
ãĥ©ãĥ³ãĥī
-0.15
ospel
-0.15
ially
-0.15
.dm
-0.15
igans
-0.15
ibling
-0.15
eker
-0.14
ẩm
-0.14
úsqueda
-0.14
POSITIVE LOGITS
Clim
0.15
749
0.14
itu
0.14
Penn
0.14
side
0.14
fa
0.14
un
0.13
in
0.13
WithValue
0.13
inus
0.13
Activations Density 0.033%