INDEX
    Explanations

    references to the name "John."

    New Auto-Interp
    Negative Logits
    hl
    -0.17
     Franti
    -0.16
    opsis
    -0.16
    opup
    -0.15
    quisites
    -0.15
    ycz
    -0.15
     mund
    -0.15
    -mf
    -0.14
    زار
    -0.14
    Nom
    -0.14
    POSITIVE LOGITS
     Bapt
    0.20
    annes
    0.18
    nes
    0.18
     XX
    0.17
    nie
    0.17
     Baptist
    0.17
    NES
    0.17
    stone
    0.16
     Cab
    0.15
    Cab
    0.15
    Act Density 0.026%

    No Known Activations