INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.72
    ніципалі
    -0.71
     ब्रेकडाउन
    -0.60
     nakalista
    -0.57
     propOrder
    -0.57
    prefixer
    -0.55
     Vikipedi
    -0.54
     الحره
    -0.54
     indígen
    -0.53
    konomi
    -0.51
    POSITIVE LOGITS
     ever
    0.47
     anywhere
    0.45
     at
    0.45
     all
    0.42
    setAll
    0.41
    esgue
    0.40
     Dor
    0.39
     quaisquer
    0.39
     any
    0.39
     All
    0.38
    Act Density 0.009%

    No Known Activations