INDEX
Explanations
mathematical symbols and variables
New Auto-Interp
Negative Logits
-0.48
#+#
-0.47
Erstellt
-0.45
Thrust
-0.43
schirm
-0.43
ferent
-0.42
mtliche
-0.42
amb
-0.41
DNEY
-0.40
tanleria
-0.39
POSITIVE LOGITS
$,
1.16
}$,
1.15
)$,
1.07
}}$,
1.01
}}$,
0.98
]$,
0.97
\}$,
0.91
)}$,
0.90
”,
0.88
》,
0.88
Activations Density 1.750%