INDEX
    Explanations

    words indicating capability or suitability, particularly those ending in 'able'

    New Auto-Interp
    Negative Logits
    ing
    -0.08
    ed
    -0.08
    ese
    -0.08
    ãĤ¥
    -0.07
    egal
    -0.07
    asso
    -0.07
    edb
    -0.07
    arily
    -0.07
    kowski
    -0.07
    el
    -0.07
    POSITIVE LOGITS
    heid
    0.09
    -bodied
    0.09
     Jar
    0.07
    uchar
    0.07
    ilty
    0.07
    yg
    0.07
    머ëĭĪ
    0.07
    ãĥ¼ãĥĹ
    0.07
    ipsis
    0.06
    Jar
    0.06
    Act Density 0.072%

    No Known Activations