INDEX
Explanations
actions or changes related to physical movement or adjustments
New Auto-Interp
Negative Logits
translation
-0.14
plits
-0.14
asil
-0.14
onds
-0.14
пÑĤом
-0.13
à¸ģรรม
-0.13
enna
-0.13
378
-0.13
elas
-0.13
ras
-0.12
POSITIVE LOGITS
-in
0.51
-In
0.34
inn
0.33
-IN
0.32
into
0.32
-i
0.30
-ins
0.30
ins
0.30
ин
0.28
-ln
0.27
Activations Density 0.087%