INDEX
Explanations
negations and expressions of inability or non-existence
New Auto-Interp
Negative Logits
adpleegd
-0.78
setSource
-0.76
Thales
-0.75
merce
-0.75
Weiss
-0.75
Gibbs
-0.74
PACE
-0.73
_('-0.71
ebe
-0.71
Meyer
-0.71
POSITIVE LOGITS
isn
1.28
__":
1.26
wasn
1.22
weren
1.20
Wasn
1.19
didn
1.16
Isn
1.15
aren
1.15
__':
1.14
mustn
1.14
Activations Density 0.076%