INDEX
    Explanations

    names of people and their associated actions or statuses

    New Auto-Interp
    Negative Logits
     _
    -0.06
     unto
    -0.06
    ould
    -0.06
    sd
    -0.06
     import
    -0.06
    etic
    -0.06
    sx
    -0.06
    ym
    -0.06
    éĥ½ä¼ļ
    -0.06
    -0.05
    POSITIVE LOGITS
    onis
    0.07
    ÙĦÙĬÙĦ
    0.07
    uese
    0.07
    ENDOR
    0.07
    aye
    0.07
     ÙĦدÙĬ
    0.07
    endale
    0.07
     è©ķ価
    0.07
     благод
    0.07
    uffer
    0.07
    Act Density 0.023%

    No Known Activations