INDEX
Explanations
personal pronouns and contractions, indicating a focus on individual perspectives and experiences
New Auto-Interp
Negative Logits
‘
-1.53
“
-1.52
’
-1.33
”
-1.28
.’
-1.26
’,
-1.26
’.
-1.25
=”
-1.24
.”
-1.19
,’
-1.13
POSITIVE LOGITS
Jefus
1.38
purpoſe
1.37
poffible
1.30
Efq
1.30
ainfi
1.29
ſtate
1.28
Majefty
1.26
againſt
1.24
houſe
1.23
myſelf
1.22
Activations Density 0.602%