INDEX
    Explanations

    equals sign

    New Auto-Interp
    Negative Logits
     intimate
    -0.09
     концент
    -0.08
     slechts
    -0.08
     فا
    -0.08
     بیشتر
    -0.08
     gelegenheid
    -0.08
     صورت
    -0.08
    achievement
    -0.08
    artan
    -0.08
     amput
    -0.08
    POSITIVE LOGITS
     
    0.09
    .lib
    0.08
     Mas
    0.08
    0.07
    0.07
    0.07
    Mas
    0.07
     mas
    0.07
    å
    0.07
    .val
    0.07
    Act Density 0.096%

    No Known Activations