INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     reach
    -0.08
    เจ
    -0.08
     olymp
    -0.08
     brill
    -0.07
     DSL
    -0.07
     salga
    -0.07
     pastries
    -0.07
    silver
    -0.07
     બનાવ
    -0.07
     Schw
    -0.07
    POSITIVE LOGITS
     общения
    0.12
     verbal
    0.11
    0.09
     ileti
    0.09
     verbally
    0.09
     التواصل
    0.09
    0.09
    交流
    0.09
    েক্স
    0.08
     nói
    0.08
    Act Density 0.008%

    No Known Activations