INDEX
    Explanations

    words ending in 'ed'

    the past tense form of verbs

    New Auto-Interp
    Negative Logits
    女
    -0.86
    FTWARE
    -0.79
     Leilan
    -0.78
    å£
    -0.78
    ORY
    -0.71
    ãĥ¯ãĥ³
    -0.71
    åħī
    -0.68
    itars
    -0.67
    BOOK
    -0.64
     caliber
    -0.64
    POSITIVE LOGITS
    ict
    0.89
    nesday
    0.89
    uct
    0.88
    ging
    0.83
    uled
    0.83
    own
    0.83
    dit
    0.81
    ded
    0.80
    usa
    0.79
    ding
    0.79
    Act Density 0.072%

    No Known Activations