INDEX
Explanations
punctuation marks and questions in text
New Auto-Interp
Negative Logits
!
-0.58
low
-0.54
ideal
-0.53
Off
-0.52
In
-0.51
yes
-0.51
<h5>
-0.50
!
-0.50
Yes
-0.50
pure
-0.50
POSITIVE LOGITS
Majefty
1.03
Efq
1.03
SourceChecksum
0.98
?!?
0.98
Houſe
0.97
ſelves
0.95
?!?!
0.94
?!"
0.92
Monfieur
0.91
myſelf
0.89
Activations Density 0.196%