INDEX
    Explanations

    references to gambling or betting activities

    New Auto-Interp
    Negative Logits
    dre
    -0.17
    submenu
    -0.15
    yı
    -0.15
    stal
    -0.14
    mey
    -0.14
    writing
    -0.13
    exus
    -0.13
    anki
    -0.13
     matter
    -0.13
    åĴ²
    -0.13
    POSITIVE LOGITS
    ãģĬãĤĬ
    0.16
    ÏħÏĦÏĮ
    0.14
    ting
    0.14
    ply
    0.14
    eph
    0.14
    tors
    0.14
    cha
    0.14
    inning
    0.14
    licative
    0.13
    ixo
    0.13
    Act Density 0.020%

    No Known Activations