INDEX
Explanations
instances of refusal or obstinance in various contexts
New Auto-Interp
Negative Logits
SafeMath
-0.46
ModelAndView
-0.40
Portail
-0.40
sizeCache
-0.39
MathML
-0.39
timing
-0.39
Timing
-0.38
tartalomajánló
-0.38
คม
-0.37
cessite
-0.36
POSITIVE LOGITS
refuses
0.76
不肯
0.75
refused
0.73
refusing
0.70
refusal
0.68
insists
0.65
insisted
0.65
refuse
0.64
Refuse
0.63
stubborn
0.63
Activations Density 0.213%