INDEX
    Explanations

    words related to permission or authorization

    the phrase "wouldn't" and its variations

    New Auto-Interp
    Negative Logits
    ocal
    -0.68
    ARM
    -0.67
    æ³
    -0.65
    core
    -0.65
    story
    -0.63
     Case
    -0.63
    Dialog
    -0.63
     Gong
    -0.61
    agency
    -0.61
    dress
    -0.60
    POSITIVE LOGITS
    't
    1.19
     never
    0.84
     proble
    0.84
     surely
    0.82
    terness
    0.77
    ÃĥÃĤ
    0.76
     hardly
    0.76
     tremend
    0.76
     adjourn
    0.75
    itiveness
    0.75
    Act Density 0.011%

    No Known Activations