INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    iêu
    -0.07
     Kosten
    -0.06
     solidarity
    -0.06
    (App
    -0.06
    ضوع
    -0.06
    ствие
    -0.06
    sunuz
    -0.06
    _server
    -0.06
    lemetry
    -0.06
    .getRight
    -0.06
    POSITIVE LOGITS
     LGBT
    0.18
     LGBTQ
    0.18
    GBT
    0.11
    0.07
     leaving
    0.07
     п
    0.07
    /trunk
    0.07
    องท
    0.07
     đàn
    0.07
     youth
    0.07
    Act Density 0.002%

    No Known Activations