INDEX
Explanations
prepositions and their usage within context
New Auto-Interp
Negative Logits
ndx
-0.15
Americ
-0.15
anger
-0.14
belang
-0.14
оÑĢож
-0.14
менÑĪ
-0.14
arefa
-0.14
itize
-0.14
attery
-0.14
nat
-0.14
POSITIVE LOGITS
onu
0.15
sett
0.15
jax
0.14
ile
0.13
-transitional
0.13
odos
0.13
има
0.13
help
0.13
iled
0.13
keh
0.13
Activations Density 0.280%