INDEX
Explanations
the presence of zeros and numerical values in the text
New Auto-Interp
Negative Logits
myſelf
-0.95
dafx
-0.94
DockStyle
-0.93
Monfieur
-0.93
Jefus
-0.92
reaſon
-0.91
-0.90
^(@)
-0.89
itſelf
-0.88
Anſ
-0.83
POSITIVE LOGITS
</blockquote>
0.67
doi
0.59
0.53
ni
0.52
tek
0.51
https
0.51
http
0.51
те
0.51
?
0.50
acaktır
0.50
Activations Density 0.232%