INDEX
    Explanations

    calculations

    New Auto-Interp
    Negative Logits
     inför
    -0.07
     MX
    -0.07
    IPO
    -0.07
    ioa
    -0.07
     escalate
    -0.07
     coba
    -0.07
     Jensen
    -0.07
     mux
    -0.07
    ája
    -0.07
     chor
    -0.07
    POSITIVE LOGITS
     fewer
    0.10
     spared
    0.09
     surviving
    0.09
     quinze
    0.09
     subset
    0.09
     eighty
    0.09
    subset
    0.08
     منهم
    0.08
     sixty
    0.08
     كامل
    0.08
    Act Density 0.049%

    No Known Activations