INDEX
Explanations
references to errors and warnings, especially related to file or access issues
German words or phrases
German articles followed by nouns
New Auto-Interp
Negative Logits
houſe
-0.99
wikipagina
-0.99
pleaſure
-0.96
Houſe
-0.93
ſche
-0.92
purpoſe
-0.92
Shakspeare
-0.86
Majefty
-0.86
raiſ
-0.84
Cæsar
-0.83
POSITIVE LOGITS
same
1.11
entire
1.05
most
1.03
following
0.92
whole
0.90
latter
0.89
rest
0.89
main
0.86
“
0.84
latest
0.81
Activations Density 0.007%