INDEX
    Explanations

    phrases related to controversial topics and opinions

    New Auto-Interp
    Negative Logits
    ngth
    -0.78
    luaj
    -0.73
    opez
    -0.71
    ividual
    -0.68
    aneers
    -0.67
    ifferent
    -0.62
    perty
    -0.61
    ebted
    -0.60
    isively
    -0.60
    vertisements
    -0.60
    POSITIVE LOGITS
     happening
    0.91
     true
    0.82
     understandable
    0.81
    quickShipAvailable
    0.80
     why
    0.79
     compounded
    0.79
     untrue
    0.76
     blasphemy
    0.74
     reassuring
    0.73
    SPONSORED
    0.72
    Act Density 3.905%

    No Known Activations