INDEX
Explanations
terms related to genetics and ethical considerations
New Auto-Interp
Negative Logits
yet
-0.24
Yet
-0.23
Yet
-0.22
however
-0.20
HOWEVER
-0.20
yet
-0.20
ONLY
-0.18
However
-0.17
fi
-0.17
Though
-0.16
POSITIVE LOGITS
sino
0.36
بÙĦÚ©Ùĩ
0.34
sondern
0.32
nor
0.27
also
0.23
also
0.21
Nor
0.20
ï¼Įä¹Ł
0.19
Nor
0.19
ï¼ĮèĢĮä¸Ķ
0.19
Activations Density 0.067%