INDEX
Explanations
personal declarations or assertions made by the speaker
instances of the pronoun "I" and its variations, indicating a focus on self-reference
New Auto-Interp
Negative Logits
Awesome
-0.75
Multiple
-0.70
multiple
-0.68
CTR
-0.67
Intern
-0.67
Hey
-0.65
packs
-0.65
Looks
-0.64
Awesome
-0.63
Creat
-0.62
POSITIVE LOGITS
confess
1.11
conclude
1.10
admire
1.04
conjecture
1.03
congratulate
1.03
presume
1.02
propose
1.02
rejoice
1.01
suppose
1.01
conceive
1.00
Activations Density 0.200%