INDEX
Explanations
proper nouns, particularly names and references to individuals
New Auto-Interp
Negative Logits
Diſ
-1.21
houſe
-1.20
Theſe
-1.20
ſelf
-1.16
becauſe
-1.15
itſelf
-1.15
myſelf
-1.12
Houſe
-1.10
Eſ
-1.09
―――――
-1.09
POSITIVE LOGITS
Von
0.93
von
0.90
Von
0.82
Steph
0.76
Steph
0.74
machus
0.72
Ste
0.71
Stephens
0.69
degré
0.67
ccc
0.67
Activations Density 0.514%