INDEX
    Explanations

    words containing the substring "irm"

    variations of the word "affirm" and its derivatives

    New Auto-Interp
    Negative Logits
    REDACTED
    -0.75
    ç«
    -0.67
    å¹
    -0.66
     Cong
    -0.63
    Rus
    -0.62
    å¥
    -0.62
     largeDownload
    -0.62
     Ree
    -0.62
    lihood
    -0.62
    HY
    -0.61
    POSITIVE LOGITS
    irm
    1.75
    irms
    1.10
    irmed
    1.00
    ament
    0.99
    atively
    0.97
    etric
    0.95
    irmation
    0.91
    aton
    0.91
    ative
    0.89
    ities
    0.86
    Act Density 0.007%

    No Known Activations