INDEX
Explanations
scientific and technical terminology related to genetics and biology
New Auto-Interp
Negative Logits
instead
-0.33
instead
-0.32
Instead
-0.31
Could
-0.29
Instead
-0.27
despite
-0.27
away
-0.26
elsewhere
-0.26
Could
-0.25
via
-0.25
POSITIVE LOGITS
wh
0.17
thro
0.17
fo
0.16
.
0.15
trough
0.15
tha
0.15
t
0.15
ÂĿ
0.14
ing
0.14
st
0.14
Activations Density 0.164%