INDEX
Explanations
questions or statements starting with the word "Where"
the word "Where"
New Auto-Interp
Negative Logits
³³³³³³³³
-0.70
)].
-0.70
roller
-0.61
franc
-0.57
secretion
-0.56
gripping
-0.55
absorbing
-0.54
swipe
-0.54
palp
-0.53
<+
-0.53
POSITIVE LOGITS
fore
1.49
upon
1.45
abouts
1.43
ver
1.34
soever
1.07
velt
0.78
Dat
0.76
with
0.76
else
0.76
ipl
0.76
Activations Density 0.047%