INDEX
Explanations
references to scientific or technical terminology
New Auto-Interp
Negative Logits
itſelf
-1.15
Jefus
-1.06
ſelves
-1.01
myſelf
-1.00
tartalomajánló
-0.98
Majefty
-0.97
Cæsar
-0.97
faſt
-0.97
ſelf
-0.97
Anſ
-0.95
POSITIVE LOGITS
"
0.65
,
0.52
...
0.50
nemen
0.49
“
0.49
I
0.48
.
0.43
he
0.43
aient
0.42
A
0.42
Activations Density 0.013%