INDEX
    Explanations

    phrases that indicate existence or presence

    New Auto-Interp
    Negative Logits
    ########.
    -0.64
    Хьажоргаш
    -0.61
    especie
    -0.60
    trzyma
    -0.59
    Havolalar
    -0.59
    Namara
    -0.56
    zędu
    -0.55
    cifix
    -0.54
    rillation
    -0.54
    UnitTesting
    -0.54
    POSITIVE LOGITS
    SequentialGroup
    0.76
    fjspx
    0.75
    ftagPool
    0.75
     الحره
    0.74
    Hentet
    0.73
     propOrder
    0.71
     newBuilder
    0.69
    windowFixed
    0.69
    AndEndTag
    0.69
    الدراسه
    0.67
    Act Density 0.007%

    No Known Activations