INDEX
    Explanations

    references to promotional products and branding

    New Auto-Interp
    Negative Logits
    igon
    -0.15
     pseud
    -0.14
    iku
    -0.14
    aller
    -0.14
    actors
    -0.14
     Liberation
    -0.14
    ä¸ģ
    -0.13
    ãĥ³ãĥĹ
    -0.13
    ứng
    -0.13
    ilon
    -0.13
    POSITIVE LOGITS
     Prom
    0.30
     promotional
    0.29
     promo
    0.28
    Prom
    0.27
     Promo
    0.25
    prom
    0.25
     Promotion
    0.25
     promotion
    0.24
     promote
    0.23
     prom
    0.23
    Act Density 0.038%

    No Known Activations