INDEX
Explanations
references to the name "John."
New Auto-Interp
Negative Logits
hl
-0.17
Franti
-0.16
opsis
-0.16
opup
-0.15
quisites
-0.15
ycz
-0.15
mund
-0.15
-mf
-0.14
زار
-0.14
Nom
-0.14
POSITIVE LOGITS
Bapt
0.20
annes
0.18
nes
0.18
XX
0.17
nie
0.17
Baptist
0.17
NES
0.17
stone
0.16
Cab
0.15
Cab
0.15
Activations Density 0.026%