INDEX
Explanations
mathematical terms and symbols related to equations
\mathrm{...} math notation
New Auto-Interp
Negative Logits
,
-0.87
.
-0.86
to
-0.82
-
-0.81
in
-0.79
ent
-0.79
to
-0.78
att
-0.77
al
-0.76
-0.75
POSITIVE LOGITS
ainfi
0.97
desmotivaciones
0.94
indígen
0.93
plufieurs
0.93
queſta
0.85
Verſ
0.84
increí
0.84
pérd
0.83
enfans
0.83
zijne
0.83
Activations Density 0.008%