INDEX
Explanations
occurrences of the pronoun "I" and its variations in dialogue
New Auto-Interp
Negative Logits
ear
-0.17
lid
-0.15
out
-0.14
Hunting
-0.14
afen
-0.14
roc
-0.14
.arc
-0.14
pter
-0.14
esc
-0.14
atar
-0.14
POSITIVE LOGITS
ستÛĮ
0.16
unittest
0.16
ucht
0.15
ì´Į
0.15
ibel
0.15
wicklung
0.15
bens
0.15
nist
0.14
.toolbox
0.14
typeid
0.14
Activations Density 0.304%