INDEX
Explanations
prepositions commonly used in various contexts
New Auto-Interp
Negative Logits
-1.35
يتيمه
-1.22
<unused52>
-1.17
<unused79>
-1.16
<unused68>
-1.16
<unused14>
-1.16
<unused8>
-1.16
<unused16>
-1.16
<unused3>
-1.16
[@BOS@]
-1.16
POSITIVE LOGITS
.
0.94
↵
0.93
0.81
1
0.77
(
0.77
0
0.73
2
0.72
'
0.72
↵↵
0.71
"
0.68
Activations Density 1.614%