INDEX
Explanations
phrases indicating negation
phrases expressing negation or absence of effect
New Auto-Interp
Negative Logits
palms
-0.76
Hok
-0.71
Printed
-0.68
CoC
-0.63
Ages
-0.61
case
-0.61
Seasons
-0.61
boarding
-0.61
hog
-0.61
Handling
-0.59
POSITIVE LOGITS
ppel
1.08
omsday
1.00
vet
0.98
not
0.94
herty
0.92
pez
0.92
ozy
0.89
oms
0.89
NOT
0.89
exist
0.89
Activations Density 0.096%