INDEX
Explanations
words expressing personal opinions or perspectives
occurrences of the pronoun "I"
New Auto-Interp
Negative Logits
otherwise
-0.66
sustain
-0.64
ens
-0.62
equals
-0.58
equal
-0.57
aver
-0.56
equ
-0.56
his
-0.55
apes
-0.54
established
-0.54
POSITIVE LOGITS
I
2.97
My
1.73
IAS
1.53
Is
1.43
IU
1.40
Honestly
1.39
II
1.38
I
1.36
We
1.35
Personally
1.34
Activations Density 0.076%