INDEX
Explanations
phrases indicating a negative context or opposition
the word "Not" and its variations in different contexts
New Auto-Interp
Negative Logits
velt
-0.70
stakes
-0.67
hower
-0.61
ç·
-0.60
æĸ
-0.59
Circuit
-0.59
kamp
-0.58
CHR
-0.57
Kry
-0.56
ixel
-0.56
POSITIVE LOGITS
epad
1.33
eworthy
1.16
icably
1.12
withstanding
1.11
icable
1.10
necessarily
1.08
orious
1.08
ifications
1.06
ifier
1.04
ional
1.01
Activations Density 0.102%