INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ین
    1.14
    ב
    1.11
    ために
    1.03
    0.93
    }{
    0.92
    Officers
    0.91
    ک
    0.90
    ש
    0.90
    August
    0.89
    )}
    0.88
    POSITIVE LOGITS
    ור
    1.38
    1.04
     divulg
    1.03
     configur
    0.97
     vend
    0.96
     cré
    0.96
     trifling
    0.95
     protracted
    0.93
     financi
    0.93
    ()=>{
    0.92
    Act Density 0.144%

    No Known Activations