INDEX
    Explanations

    words related to limits or restrictions

    terms related to limits or restrictions

    New Auto-Interp
    Negative Logits
    avi
    -0.79
    otive
    -0.78
    amara
    -0.75
    agent
    -0.71
    ee
    -0.70
    thora
    -0.69
    indust
    -0.68
    ives
    -0.67
    iquette
    -0.67
    Reply
    -0.66
    POSITIVE LOGITS
     capped
    0.99
     caps
    0.73
     pegged
    0.72
     compens
    0.68
    stan
    0.68
     Dunn
    0.65
     nickel
    0.65
    llan
    0.65
    locked
    0.63
    ULAR
    0.62
    Act Density 0.012%

    No Known Activations