INDEX
Explanations
special characters and formatting indicators used in documents
New Auto-Interp
Negative Logits
coli
-0.15
anko
-0.14
-density
-0.14
cky
-0.14
fty
-0.14
oller
-0.14
Density
-0.14
Manor
-0.14
bes
-0.13
=en
-0.13
POSITIVE LOGITS
note
0.24
Foot
0.19
-note
0.18
Note
0.17
foot
0.17
Foot
0.16
gang
0.16
Note
0.16
注
0.16
nota
0.16
Activations Density 0.010%