INDEX
Explanations
species names and their classifications
New Auto-Interp
Negative Logits
+:+
-0.55
–
-0.52
-
-0.49
standard
-0.48
tambor
-0.48
,
-0.47
R
-0.47
-0.46
/
-0.46
(
-0.45
POSITIVE LOGITS
Diwedd
0.88
насељу
0.79
Però
0.78
utives
0.77
myſelf
0.77
itſelf
0.74
Majefty
0.73
Efq
0.73
ſtate
0.72
<>",
0.72
Activations Density 0.349%