INDEX
Explanations
actions followed by prepositions
New Auto-Interp
Negative Logits
the
0.67
The
0.52
একজন
0.51
has
0.50
larının
0.49
anunció
0.49
arovski
0.47
reveló
0.47
telah
0.47
ferencia
0.47
POSITIVE LOGITS
grads
0.51
किंतु
0.49
alır
0.46
latter
0.45
plupart
0.45
pptn
0.44
restent
0.43
тощо
0.42
alkalies
0.42
mainstay
0.42
Activations Density 0.046%