INDEX
Explanations
phrases indicating conditions or qualifiers
New Auto-Interp
Negative Logits
iegel
-0.14
tua
-0.13
âĢĮد
-0.13
sice
-0.13
umper
-0.13
_intf
-0.13
ÎķÏĢ
-0.13
ÙĪÙĦÙĬ
-0.12
Handy
-0.12
idan
-0.12
POSITIVE LOGITS
otherwise
0.33
otherwise
0.28
Otherwise
0.27
OTHERWISE
0.25
Otherwise
0.23
-ÑĤо
0.18
thereby
0.18
foy
0.17
Includes
0.17
then
0.17
Activations Density 0.252%