INDEX
Explanations
numerical values relating to measurements or counts
New Auto-Interp
Negative Logits
ritte
-0.14
jus
-0.14
ron
-0.14
lub
-0.13
fait
-0.13
å
-0.13
eten
-0.13
argo
-0.13
sel
-0.13
.into
-0.13
POSITIVE LOGITS
kea
0.19
herits
0.15
berman
0.15
st
0.15
\views
0.15
amarin
0.14
cho
0.14
quo
0.14
è´«
0.14
seiz
0.13
Activations Density 0.060%