INDEX
Explanations
prepositions and directions
prepositions and words indicating relationships or direction
New Auto-Interp
Negative Logits
soever
-0.56
(*
-0.55
@@
-0.49
unwelcome
-0.45
preferred
-0.44
GS
-0.43
UNCLASSIFIED
-0.43
âĢİ
-0.43
desired
-0.43
unemploy
-0.42
POSITIVE LOGITS
cephal
0.59
pless
0.58
ueller
0.56
urnal
0.56
olen
0.53
ovi
0.53
odan
0.52
arton
0.51
ãĥ´
0.51
inav
0.51
Activations Density 1.357%