INDEX
    Explanations

    patterns related to suffixes or endings of words

    New Auto-Interp
    Negative Logits
     DRAG
    -0.69
    ailability
    -0.66
     substance
    -0.64
    ãĥīãĥ©ãĤ´ãĥ³
    -0.62
    llah
    -0.62
    PLIC
    -0.61
    iannopoulos
    -0.61
     colleges
    -0.61
     scant
    -0.59
     concess
    -0.59
    POSITIVE LOGITS
    kamp
    0.92
    furt
    0.91
    adder
    0.89
    wagen
    0.86
     sidx
    0.77
    tsy
    0.77
    erella
    0.73
    fried
    0.73
    velt
    0.69
    bourg
    0.68
    Act Density 0.018%

    No Known Activations