INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     dirty
    -0.06
    Switch
    -0.06
     خارج
    -0.06
    -A
    -0.06
    گران
    -0.06
     exchanging
    -0.06
    EqualTo
    -0.06
    icari
    -0.06
    _summary
    -0.06
     خر
    -0.06
    POSITIVE LOGITS
     Guinness
    0.07
    0.07
    ija
    0.07
     specifying
    0.06
     departing
    0.06
     Kohana
    0.06
     '__
    0.06
     Economist
    0.06
    leine
    0.06
     GW
    0.06
    Act Density 0.041%

    No Known Activations