INDEX
Explanations
special characters, particularly various forms of the © symbol and accents
New Auto-Interp
Negative Logits
RegressionTest
-1.21
GEBURTSDATUM
-1.13
preſent
-1.01
purpoſe
-1.00
ſy
-1.00
ſtate
-0.99
prefent
-0.98
fevere
-0.98
propOrder
-0.96
uſe
-0.94
POSITIVE LOGITS
</em>
0.98
</i>
0.89
s
0.81
}}$
0.78
</sub>
0.78
</strong>
0.76
}}
0.71
</b>
0.70
</sup>
0.70
[toxicity=0]
0.69
Activations Density 0.054%