INDEX
Explanations
expressions of regret or acknowledgment of previous mistakes
New Auto-Interp
Negative Logits
Reſ
-1.28
Anſ
-1.28
Houſe
-1.20
itſelf
-1.19
―――――
-1.18
faſt
-1.18
houſe
-1.17
Theſe
-1.17
pleaſure
-1.16
Efq
-1.11
POSITIVE LOGITS
noted
1.15
notes
1.13
notable
1.09
Note
1.01
note
1.00
Notable
0.97
Notes
0.91
Note
0.89
Notable
0.85
note
0.79
Activations Density 0.143%