INDEX
Explanations
personal reflections and insights expressed in the first person singular
first-person pronouns and self-referential statements
New Auto-Interp
Negative Logits
Delivery
-0.75
Pric
-0.67
Looks
-0.64
Impact
-0.61
totality
-0.60
TBD
-0.60
Deadline
-0.59
optics
-0.59
unspecified
-0.58
farious
-0.58
POSITIVE LOGITS
've
1.25
'm
1.20
suppose
1.09
deals
1.08
prefer
1.07
cringe
1.05
ronic
1.05
rarely
1.03
vividly
1.03
am
1.03
Activations Density 0.250%