INDEX
Explanations
numerical data or references
New Auto-Interp
Negative Logits
<bos>
-0.85
,:);
-0.83
Personensuche
-0.77
ussis
-0.73
NDEBUG
-0.71
Verso
-0.70
Vea
-0.69
endphp
-0.67
Versace
-0.64
électron
-0.63
POSITIVE LOGITS
three
1.25
3
1.25
three
1.19
Three
1.14
Three
1.10
THREE
1.08
THREE
1.00
threes
0.98
4
0.97
trois
0.97
Activations Density 0.657%