INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Norris
    -0.55
     متعلقه
    -0.54
    apati
    -0.49
    RunWith
    -0.49
    Vidu
    -0.44
     trusted
    -0.43
    creativecommons
    -0.43
    uta
    -0.42
    ΤΗ
    -0.42
     Gin
    -0.42
    POSITIVE LOGITS
     ProtoMessage
    0.78
    PreferredItem
    0.67
    verwijspagina
    0.66
     kasarigan
    0.63
    Personensuche
    0.63
     EClass
    0.63
    Tembelea
    0.61
    Демографія
    0.61
     FetchType
    0.60
    testify
    0.60
    Act Density 0.221%

    No Known Activations