INDEX
    Explanations

    phrases or words indicating negation or denial

    the phrase "ain't," indicating informal or colloquial expressions

    New Auto-Interp
    Negative Logits
    INC
    -0.70
    Impl
    -0.63
     Agency
    -0.63
    ULT
    -0.62
     Promotion
    -0.62
    =-=-=-=-=-=-=-=-
    -0.62
    EV
    -0.61
    ersen
    -0.61
     Carbuncle
    -0.61
    IAN
    -0.61
    POSITIVE LOGITS
    't
    0.96
     ain
    0.92
    gin
    0.88
     gonna
    0.83
    \\\\\\\\
    0.83
    strument
    0.82
    ãĥ³ãĤ¸
    0.82
    ny
    0.77
    ga
    0.76
    thouse
    0.76
    Act Density 0.008%

    No Known Activations