INDEX
    Explanations

    names starting with Fran, Franc, Frank

    New Auto-Interp
    Negative Logits
    cortina
    -0.83
    boîte
    -0.79
     atop
    -0.75
     where
    -0.75
     ухода
    -0.73
    ודי
    -0.70
     young
    -0.69
    Trung
    -0.69
     hinted
    -0.69
     pfle
    -0.69
    POSITIVE LOGITS
     fran
    1.02
    Fran
    0.99
     Fran
    0.91
     franc
    0.88
    kende
    0.87
    fran
    0.84
     Laughter
    0.84
    ">-
    0.83
     Franken
    0.82
    imanapun
    0.82
    Act Density 0.012%

    No Known Activations