INDEX
Explanations
references to personal pronouns or reflexive pronouns
New Auto-Interp
Negative Logits
purpoſe
-1.32
Majefty
-1.26
faſt
-1.21
houſe
-1.18
raiſ
-1.17
pleaſure
-1.14
ſtate
-1.12
Anſ
-1.12
Perſ
-1.11
Houſe
-1.10
POSITIVE LOGITS
se
1.33
Se
0.93
si
0.90
es
0.87
le
0.77
be
0.77
je
0.74
../../
0.72
s
0.71
ab
0.71
Activations Density 0.028%