INDEX
    Explanations

    the presence of the names "Jean" and "Jane."

    New Auto-Interp
    Negative Logits
    RenderAtEndOf
    -1.02
     myſelf
    -1.01
     itſelf
    -1.01
     Monfieur
    -0.92
     nahilalakip
    -0.91
     Eſ
    -0.89
     يتيمه
    -0.89
     Efq
    -0.89
    曖昧さ回避
    -0.88
    AndEndTag
    -0.88
    POSITIVE LOGITS
     Jane
    1.87
     Jean
    1.65
    Jane
    1.64
    Jean
    1.43
    Jan
    1.42
     Jan
    1.39
    jan
    1.35
     JAN
    1.27
     JEAN
    1.27
     jane
    1.26
    Act Density 0.051%

    No Known Activations