INDEX
Explanations
occurrences of various forms of the word "as."
New Auto-Interp
Negative Logits
i
-0.22
tm
-0.21
kal
-0.20
yar
-0.20
han
-0.19
ÛĮ
-0.19
eing
-0.19
ká
-0.19
hum
-0.19
hu
-0.18
POSITIVE LOGITS
aurus
0.19
phalt
0.18
sembler
0.17
ãĤ±ãĥĥãĥĪ
0.17
nost
0.17
íĭ±
0.17
hton
0.16
ional
0.16
fak
0.16
gow
0.16
Activations Density 0.102%