INDEX
Explanations
instances of the word "never" indicating negation or past denial
New Auto-Interp
Negative Logits
Muffins
-0.80
stdc
-0.76
KommentareTeilen
-0.74
:]:
-0.74
للمعارف
-0.72
DispatchToProps
-0.72
raszam
-0.72
voegd
-0.70
cioso
-0.69
vidia
-0.69
POSITIVE LOGITS
Never
1.53
never
1.49
NEVER
1.48
NEVER
1.47
Never
1.46
never
1.42
EVER
1.16
Nunca
1.16
Nunca
1.10
Ever
1.09
Activations Density 0.047%