INDEX
Explanations
phrases indicating uncertainty or negation regarding existence or quality
New Auto-Interp
Negative Logits
aper
-0.17
etch
-0.16
leo
-0.16
nackte
-0.15
ouv
-0.15
avn
-0.14
McCabe
-0.14
á»ģn
-0.13
variants
-0.13
Stateless
-0.13
POSITIVE LOGITS
chances
0.28
need
0.27
possibility
0.26
chance
0.23
probability
0.21
likelihood
0.20
need
0.19
Need
0.19
possibilities
0.19
Probability
0.18
Activations Density 0.064%