INDEX
Explanations
references to chaplains and related terminology
New Auto-Interp
Negative Logits
essel
-0.16
izzato
-0.16
pitch
-0.15
üf
-0.15
orf
-0.15
ön
-0.15
es
-0.14
rib
-0.14
entifier
-0.14
ously
-0.14
POSITIVE LOGITS
lain
0.29
chap
0.23
chap
0.22
itre
0.20
Chap
0.19
anooga
0.18
Stick
0.18
AGAIN
0.18
elize
0.18
lin
0.17
Activations Density 0.004%