INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    TargetException
    -0.31
     atx
    -0.31
     eng
    -0.30
    дента
    -0.29
     esc
    -0.29
     daz
    -0.28
    fxml
    -0.28
     alp
    -0.27
     res
    -0.27
     Aires
    -0.27
    POSITIVE LOGITS
    KommentareTeilen
    0.77
     utafitiHapana
    0.63
    sonaro
    0.58
    Билгалдахарш
    0.56
     Bedien
    0.55
     vecinos
    0.54
    nestjs
    0.53
     dipendenti
    0.52
     Wikimedijinoj
    0.52
    uesia
    0.52
    Act Density 0.025%

    No Known Activations