INDEX
Explanations
the word "Not" indicating negation
the phrase "Not" followed by various contexts indicating exceptions or disclaimers
New Auto-Interp
Negative Logits
éĥ
-0.76
ç·
-0.76
éģ
-0.72
stakes
-0.71
kamp
-0.71
ãģ®ç
-0.70
æ©
-0.69
creen
-0.69
大
-0.69
Mehran
-0.67
POSITIVE LOGITS
epad
1.21
withstanding
1.16
orious
1.10
icably
1.07
necessarily
1.04
eworthy
0.98
ices
0.93
ific
0.90
ifications
0.89
icia
0.86
Activations Density 0.063%