INDEX
Explanations
frequent auxiliary verbs and forms of "to be."
New Auto-Interp
Negative Logits
Bast
-0.15
_cid
-0.14
onte
-0.14
dfd
-0.14
her
-0.14
mens
-0.14
uko
-0.14
subur
-0.13
ici
-0.13
none
-0.13
POSITIVE LOGITS
utta
0.15
unken
0.15
vais
0.15
egan
0.15
529
0.14
raid
0.14
à¹Īาà¸Ļ
0.14
uper
0.14
Rogue
0.14
éĽ²
0.14
Activations Density 0.003%