INDEX
Explanations
negations and contractions
New Auto-Interp
Negative Logits
ätt
-0.16
unal
-0.15
incr
-0.15
lew
-0.15
ë»
-0.15
nez
-0.14
Firmware
-0.14
unden
-0.14
forgettable
-0.14
realpath
-0.14
POSITIVE LOGITS
sure
0.30
sure
0.24
alone
0.24
phased
0.23
anymore
0.21
Sure
0.21
Sure
0.20
allowed
0.20
finished
0.19
nearly
0.19
Activations Density 0.131%