INDEX
    Explanations

    phrases related to promoting or advertising something

    New Auto-Interp
    Negative Logits
    quartered
    -0.76
    sbm
    -0.69
    */(
    -0.68
     ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
    -0.68
    assian
    -0.66
    fuck
    -0.66
    falls
    -0.66
     Detected
    -0.66
    asso
    -0.65
    kes
    -0.65
    POSITIVE LOGITS
     awareness
    0.88
     abstinence
    0.83
    andise
    0.81
    entious
    0.80
     entrepreneurship
    0.80
     equality
    0.79
     democracy
    0.79
     intolerance
    0.78
     excellence
    0.78
    ably
    0.77
    Act Density 0.070%

    No Known Activations