INDEX
    Explanations

    phrases including the word "leader."

    occurrences of the suffix "er" in various contexts

    New Auto-Interp
    Negative Logits
    ĸļ
    -0.96
    acebook
    -0.71
    chwitz
    -0.69
    eenth
    -0.69
    raltar
    -0.68
    atform
    -0.65
    luaj
    -0.62
    urities
    -0.62
    ertodd
    -0.61
    uncture
    -0.60
    POSITIVE LOGITS
    jee
    1.01
    lein
    0.95
    idge
    0.88
    ger
    0.86
    adish
    0.84
    lich
    0.82
    rors
    0.82
    aton
    0.81
    baum
    0.79
    asures
    0.79
    Act Density 0.051%

    No Known Activations