INDEX
    Explanations

    references to singular entities or concepts

    New Auto-Interp
    Negative Logits
    inem
    -0.68
    primary
    -0.64
    current
    -0.63
    apse
    -0.62
    ãĤ¼ãĤ¦ãĤ¹
    -0.60
    arden
    -0.60
    mun
    -0.60
    Ͻ
    -0.59
    accompan
    -0.59
    access
    -0.58
    POSITIVE LOGITS
     mention
    0.84
    icable
    0.83
     bothering
    0.76
    LESS
    0.76
     ounce
    0.73
     nor
    0.73
     kidding
    0.73
     dime
    0.71
    ches
    0.69
    icably
    0.68
    Act Density 0.072%

    No Known Activations