INDEX
Explanations
affirmative statements or confirmations
New Auto-Interp
Negative Logits
ⓧ
-0.62
yntaxException
-0.60
AppBundle
-0.58
IContainer
-0.56
tagHelperRunner
-0.55
Rhymes
-0.54
DeleteBehavior
-0.53
etera
-0.52
einem
-0.51
insuffisamment
-0.49
POSITIVE LOGITS
False
0.84
False
0.84
believers
0.84
believer
0.79
True
0.76
TRUE
0.73
false
0.71
TRUE
0.70
stdbool
0.67
colors
0.67
Activations Density 0.094%