INDEX
    Explanations

    phrases related to unauthorized actions or behaviors

    terms related to unauthorized activities or entities

    New Auto-Interp
    Negative Logits
     dyn
    -0.77
    auga
    -0.76
    uin
    -0.72
     Gamble
    -0.71
    idon
    -0.71
    win
    -0.69
    bourne
    -0.67
    stadt
    -0.66
    amide
    -0.66
     Commit
    -0.64
    POSITIVE LOGITS
     unauthorized
    2.52
    ör
    1.54
    eties
    1.51
     infrared
    1.36
    rared
    1.16
    authorized
    1.09
    Pocket
    0.95
     Pocket
    0.95
    youtube
    0.92
     Ghostbusters
    0.89
    Act Density 0.031%

    No Known Activations