INDEX
Explanations
punctuation marks and sentence endings
New Auto-Interp
Negative Logits
_EOF
-0.16
amera
-0.14
Craft
-0.14
alette
-0.14
_LANG
-0.14
Filed
-0.13
iras
-0.13
av
-0.13
nailed
-0.13
ÏĦεÏį
-0.12
POSITIVE LOGITS
|
0.34
##
0.27
|
0.21
|"
0.20
اÙĦصÙģ
0.19
.|
0.19
|:
0.18
|/
0.18
|R
0.18
|x
0.17
Activations Density 0.015%