INDEX
Explanations
negations within texts
the repeated phrase "Not" as a way to introduce negation or contrast in statements
New Auto-Interp
Negative Logits
kamp
-0.73
stakes
-0.72
éĥ
-0.72
ãģ®ç
-0.72
ãģ«
-0.71
ç·
-0.70
creen
-0.70
ä¿
-0.67
velt
-0.66
è£
-0.66
POSITIVE LOGITS
withstanding
1.29
orious
1.15
eworthy
1.08
epad
1.07
icably
1.04
necessarily
1.03
ifications
0.92
icing
0.88
ices
0.88
etheless
0.87
Activations Density 0.073%