INDEX
Negative Logits
ErrorMessage
-0.08
_confirmation
-0.07
Wallace
-0.07
createClass
-0.07
VV
-0.06
PHA
-0.06
Wellington
-0.06
г
-0.06
recent
-0.06
.ZERO
-0.06
POSITIVE LOGITS
economic
0.06
建立
0.06
剧
0.06
iously
0.06
Benefits
0.05
Russians
0.05
CEOs
0.05
Reduced
0.05
informs
0.05
sick
0.05
Activations Density 0.021%