INDEX
Explanations
negations or expressions of contrary statements
New Auto-Interp
Negative Logits
IsEmpty
-0.59
.
-0.57
corresponden
-0.53
is
-0.53
sqcup
-0.53
ishy
-0.51
Silverman
-0.51
Dillon
-0.51
es
-0.50
pretation
-0.50
POSITIVE LOGITS
not
1.46
Not
1.29
Not
1.28
not
1.25
NOT
1.13
Италијани
1.07
IntoConstraints
1.06
NOT
1.04
Niet
1.01
propOrder
1.00
Activations Density 0.155%