INDEX
Explanations
LaTeX formatting elements and structures used in mathematical expressions
New Auto-Interp
Negative Logits
FetchType
-0.80
mente
-0.71
orsz
-0.67
Wię
-0.64
UIButton
-0.63
ANIM
-0.63
ázaro
-0.62
nav
-0.61
Παραπομπές
-0.61
阅读
-0.60
POSITIVE LOGITS
↵
1.11
↵↵
0.91
0.90
}{*}{0.90
0.89
0.84
0.83
0.83
[toxicity=0]
0.82
0.82
Activations Density 0.020%