INDEX
Explanations
actions related to movement and direction
New Auto-Interp
Negative Logits
Personensuche
-0.63
lismo
-0.51
muna
-0.47
vertre
-0.46
criteria
-0.45
ثیر
-0.45
["",
-0.44
mourut
-0.44
aryti
-0.43
fossa
-0.42
POSITIVE LOGITS
towards
1.26
toward
1.23
towards
1.21
toward
1.18
Towards
1.03
Toward
1.02
Towards
0.94
TOW
0.85
Toward
0.84
menuju
0.75
Activations Density 0.119%