INDEX
Explanations
instances of the word "het" and similar pronouns in various contexts
New Auto-Interp
Negative Logits
ello
-0.16
hold
-0.15
ering
-0.15
kt
-0.15
yy
-0.14
ITY
-0.14
ãĤ»ãĥ³
-0.14
whelming
-0.14
hed
-0.14
thic
-0.14
POSITIVE LOGITS
âĹĦ
0.20
ovah
0.15
alem
0.15
utsch
0.14
retch
0.14
ioni
0.14
pii
0.14
feof
0.14
à¹Īà¸ĩ
0.14
getc
0.14
Activations Density 0.017%