INDEX
Explanations
complex mathematical expressions or formatting related to scientific texts
New Auto-Interp
Negative Logits
tfrac
-0.72
>(_
-0.68
disambiguazione
-0.65
@"/
-0.62
bmod
-0.62
ệp
-0.62
AutoresizingMask
-0.61
!("{-0.60
Xna
-0.60
ʺ
-0.59
POSITIVE LOGITS
↵↵↵
0.79
↵↵↵↵↵↵
0.73
↵↵↵↵
0.72
↵↵↵↵↵
0.67
تكبرها
0.66
↵↵↵↵↵↵↵
0.61
Monfieur
0.60
↵↵↵↵↵↵↵↵
0.60
↵↵↵↵↵↵↵↵↵
0.59
\\
0.59
Activations Density 0.003%