INDEX
Explanations
the pronoun "his" in various contexts, indicating a focus on masculinity or ownership
New Auto-Interp
Negative Logits
orde
-0.19
inu
-0.16
uce
-0.15
eer
-0.15
ea
-0.14
ResourceId
-0.14
quat
-0.14
oa
-0.14
å¾
-0.14
kode
-0.14
POSITIVE LOGITS
panic
0.24
Majesty
0.23
pter
0.22
peria
0.20
sss
0.18
Excell
0.18
agram
0.17
isnan
0.17
maj
0.16
Maj
0.16
Activations Density 0.069%