INDEX
Explanations
adjectives or adverbs indicating a strong degree or intensity
intensifiers or qualifiers that suggest extremity or emphasis
New Auto-Interp
Negative Logits
bow
-0.98
sw
-0.88
res
-0.88
fl
-0.85
rental
-0.85
arm
-0.82
exp
-0.82
rating
-0.81
score
-0.81
written
-0.81
POSITIVE LOGITS
probably
1.95
possibly
1.92
almost
1.87
maybe
1.87
perhaps
1.86
sometimes
1.86
likely
1.81
nothing
1.80
always
1.79
someone
1.78
Activations Density 0.096%