INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Personensuche
    -0.82
     تضيفلها
    -0.76
    TagMode
    -0.75
     ostavi
    -0.70
     استنادى
    -0.68
    -0.68
     disponibilités
    -0.67
    rungsseite
    -0.67
    saraba
    -0.66
     public
    -0.66
    POSITIVE LOGITS
    Literatuur
    0.42
     conservatives
    0.40
    TagHelper
    0.40
    vous
    0.39
    seni
    0.39
     Lue
    0.39
    shi
    0.38
     expiring
    0.38
     lụ
    0.38
     andato
    0.38
    Act Density 0.025%

    No Known Activations