INDEX
Explanations
prepositions and articles indicating location
New Auto-Interp
Negative Logits
i
-0.16
lek
-0.16
ving
-0.15
ÙĬرة
-0.15
oret
-0.15
les
-0.14
eket
-0.14
alm
-0.14
egie
-0.14
ESSAGES
-0.14
POSITIVE LOGITS
sein
0.25
detriment
0.22
level
0.21
moment
0.20
-dess
0.19
ÃŁerdem
0.18
cours
0.16
ëłĪ벨
0.16
seins
0.16
fur
0.15
Activations Density 0.006%