INDEX
Explanations
past tense verbs and references to familiarity
New Auto-Interp
Negative Logits
orch
-0.15
ноп
-0.15
Giz
-0.15
nech
-0.14
onta
-0.14
šk
-0.14
بد
-0.14
ANEL
-0.14
ivery
-0.14
611
-0.14
POSITIVE LOGITS
ια
0.15
only
0.14
æłª
0.14
ienza
0.14
ither
0.14
.datas
0.14
eper
0.13
hôm
0.13
empl
0.13
ilm
0.13
Activations Density 0.005%