INDEX
Explanations
prepositions and phrases indicating location and time
New Auto-Interp
Negative Logits
shr
-0.15
antan
-0.14
uras
-0.13
-mar
-0.13
argo
-0.13
Measure
-0.13
UDA
-0.13
warm
-0.13
ho
-0.13
ONTAL
-0.13
POSITIVE LOGITS
herits
0.17
ạn
0.16
onis
0.15
Fon
0.15
ipse
0.15
tractor
0.14
Ñıн
0.14
anes
0.14
inges
0.14
mobx
0.14
Activations Density 0.662%