INDEX
Explanations
instances of conditional or prohibitive statements
New Auto-Interp
Negative Logits
*/
-0.70
س
-0.63
srfAttach
-0.62
thereof
-0.62
itud
-0.60
attRot
-0.59
ibles
-0.58
âĿ
-0.58
"}],"
-0.57
ÙIJ
-0.57
POSITIVE LOGITS
cknowled
0.88
Started
0.81
Own
0.75
itialized
0.71
dating
0.70
nea
0.65
gdala
0.63
iltr
0.62
Tradable
0.61
Fired
0.61
Activations Density 0.143%