INDEX
Explanations
variable assignment statements in a code context
New Auto-Interp
Negative Logits
f
-0.38
-0.38
_
-0.37
.
-0.36
F
-0.36
</
-0.35
OR
-0.35
or
-0.34
r
-0.34
Or
-0.34
POSITIVE LOGITS
={{1.54
Diweddarwch
0.94
Rüyada
0.93
propOrder
0.90
فريبيس
0.90
OGND
0.88
>{{0.86
{{0.83
${{0.81
">{{0.81
Activations Density 0.001%