INDEX
Explanations
large numbers, especially those including exponents, percentages, or those that specify ratios or measurements
potentially archaic texts
New Auto-Interp
Negative Logits
-1.20
,
-1.16
.
-1.14
(
-1.14
↵
-1.13
<eos>
-1.08
↵↵
-1.08
-
-1.07
2
-1.07
'
-1.06
POSITIVE LOGITS
Efq
2.13
Theſe
2.03
myſelf
1.98
Monfieur
1.88
itſelf
1.85
Jefus
1.77
ſelf
1.76
purpoſe
1.74
pleaſure
1.74
Anſ
1.73
Activations Density 2.212%