INDEX
Explanations
words and phrases used to add supporting information and provide additional context.
New Auto-Interp
Negative Logits
―――――
-1.13
$_"
-1.00
doubtnut
-0.97
་་
-0.94
――――
-0.92
――――――――
-0.91
Majefty
-0.86
purpoſe
-0.85
XNUMX
-0.83
becauſe
-0.83
POSITIVE LOGITS
↵↵
0.92
'
0.91
0.91
<bos>
0.87
‘
0.86
↵
0.82
0.76
A
0.73
<eos>
0.73
0.70
Activations Density 2.655%