INDEX
Explanations
sequences of whitespace or formatting characters
user input with punctuation
New Auto-Interp
Negative Logits
feroit
-0.92
pouvoit
-0.86
auroit
-0.84
ainfi
-0.83
gustaMe
-0.79
avoient
-0.77
ſta
-0.77
Theſe
-0.76
Verſ
-0.75
wikipagina
-0.73
POSITIVE LOGITS
0.87
0.76
0.66
0.63
0.58
0.57
0.57
’
0.56
0.55
'
0.55
Activations Density 0.000%