INDEX
Explanations
references to days of the week, particularly Saturday
New Auto-Interp
Negative Logits
ible
-0.17
ÑģÑĮ
-0.16
nings
-0.15
erg
-0.15
unny
-0.15
grip
-0.15
alo
-0.14
heit
-0.14
.CopyTo
-0.14
Ñģли
-0.14
POSITIVE LOGITS
ellite
0.21
ellites
0.19
abis
0.17
asso
0.15
bara
0.15
irical
0.15
tement
0.15
ennis
0.15
yr
0.15
rol
0.14
Activations Density 0.014%