INDEX
Explanations
direct speech or quotations
New Auto-Interp
Negative Logits
EconPapers
-1.14
―――――
-1.06
ſelves
-1.04
$_"
-1.02
verwijspagina
-1.02
itſelf
-1.01
Efq
-1.00
Majefty
-0.97
ſind
-0.96
)";
-0.94
POSITIVE LOGITS
"
0.88
“
0.84
“
0.73
<eos>
0.72
I
0.69
.
0.68
'
0.67
'
0.66
,"
0.65
"
0.65
Activations Density 0.048%