INDEX
Explanations
comparisons and contrasts between subjects
New Auto-Interp
Negative Logits
AndEndTag
-0.94
ModelExpression
-0.93
myſelf
-0.81
wireType
-0.77
aarrggbb
-0.77
ValueStyle
-0.77
itſelf
-0.77
AssemblyCulture
-0.76
Efq
-0.76
bershka
-0.75
POSITIVE LOGITS
than
0.52
Pri
0.52
le
0.51
pri
0.51
den
0.48
a
0.47
an
0.47
equivalent
0.45
Den
0.45
du
0.44
Activations Density 0.279%