INDEX
Explanations
questions and phrases indicating conditions or requirements
New Auto-Interp
Negative Logits
vi
-0.16
activex
-0.15
ago
-0.15
ickle
-0.15
apolis
-0.15
hide
-0.14
æĹĹ
-0.14
.bn
-0.13
DX
-0.13
DX
-0.13
POSITIVE LOGITS
Welfare
0.16
iniz
0.16
COMPARE
0.15
ecess
0.14
nex
0.14
Interest
0.14
weeted
0.14
matcher
0.14
lub
0.14
(Have
0.14
Activations Density 0.002%