INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     rash
    -0.08
    -0.08
    ناسبة
    -0.08
    alee
    -0.07
    Expert
    -0.07
    صاب
    -0.07
    463
    -0.07
     sat
    -0.07
     schrift
    -0.07
     induct
    -0.07
    POSITIVE LOGITS
     toget
    0.08
     Seks
    0.08
     Tung
    0.08
     nou
    0.08
     praten
    0.07
     communicate
    0.07
    .Exchange
    0.07
     teaching
    0.07
     Ss
    0.07
    rates
    0.07
    Act Density 0.000%

    No Known Activations