INDEX
    Explanations

    phrases indicating frequency or conditions regarding events or actions

    New Auto-Interp
    Negative Logits
    Obrigada
    -0.52
    onade
    -0.51
    وردار
    -0.48
    nac
    -0.48
     numberWith
    -0.47
     ACKNOWLEDG
    -0.47
    enseits
    -0.47
    ostom
    -0.46
     Occidente
    -0.46
    momix
    -0.45
    POSITIVE LOGITS
    0.88
    SharedCtor
    0.87
     lenker
    0.83
     كومونز
    0.82
    GEBURTSDATUM
    0.75
    IUrlHelper
    0.75
    CppMethod
    0.73
    :✨
    0.70
    Personensuche
    0.69
     IBOutlet
    0.69
    Act Density 0.130%

    No Known Activations