INDEX
    Explanations

    various forms of the word "accept" and its related expressions

    New Auto-Interp
    Negative Logits
    859
    -0.19
    zÅij
    -0.15
    lá
    -0.15
    abouts
    -0.14
    lev
    -0.14
    omor
    -0.14
    urai
    -0.14
    ahl
    -0.14
    esis
    -0.14
    ASA
    -0.14
    POSITIVE LOGITS
     offered
    0.24
    offer
    0.23
     responsibility
    0.22
     offers
    0.20
    offers
    0.19
    challenge
    0.19
     challenge
    0.19
     premise
    0.18
     Responsibility
    0.18
     offer
    0.18
    Act Density 0.119%

    No Known Activations